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VO (54) Title: IDENTIHCATION OF OVARL\N CANCER TUMOR MARKERS AND THERAPEUTIC TARGETS 

2 (57) Abstract: The present disclosure provides methods for classifying ovarian tumors into BRCAI-type, BRCA2-type or non-BRC A- 
type tumor types by measuring expression levels of a plurality of disclosed ovarian tumor markers. The markers disclosed herein 
^ are useful in the diagnosis, staging, detection, and/or treatment of ovarian cancer. Also provided are methods of selecting a treat- 
^ ment regimen by selecting the tumor type. Ovarian cancer-linked logarithmic expression ratios and kits for diagnosis, staging, and 
^ detection of ovarian cancer using are also provided. 
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IDENTIFICATION OF OVARIAN CANCER TUMOR MARKERS 
AND THERAPEUTIC TARGETS 



PRIORITY CLAIM 

5 This application claims the benefit of U.S. Provisional Application No. 60/357,03 1, filed 

Febiuaiy 13, 2002, which is incorporated by reference in its entirety herein. 

FIELD OF THE DISCLOSURE 
The present disclosure is related to diagnosing, prognosing, staging, preventing, and treating 
10 disease, particularly ovarian cancer. 



BACKGROUND 

Ovarian cancer has one of the highest mortality rates of all cancers, due in part to the 
difficulty of diagnosis. Currently, epithelial ovarian cancer is the leading cause of death resulting 
15 from gynecological cancer (see Welsh et al, PNAS 98: 1176-1 181, 2001). Studies indicate that the 
five-year survival rates for ovarian cancer are as follows: Stage I (93%), Stage n (70%), Stage HI 
(37%), and Stage IV (25%) (see Holschneider & Berek, Seniin, Surg. Oncol 19: 3-10, 2000). Thus, 
there is a particular need for in^roved methods of early diagnosis, prognosis, and monitoring of 
ovarian cancer. 

20 Protein and mRNA levels, and changes in these levels, may be associated with specific types 

of cancer (and cancer progression). Such association is often specific to the type of cancer, meaTiing 
diat what is overe3q)ressed in one cancer may be under-e^^ressed (or unchanged) in anotiher. Thus, a 
collection or set of genes^roteins that are differentially regulated in a specific cancer may be 
indicative and specifically diagnostic of that type of cancer. 

25 .Moleculafmechahisms involved in the onset an^progres^^^ 

poorly understood. However, some mutations causing ovarian cancer have been identified. Between 
S% and 10% of all breast cancers are hereditary. The remaining 90% to 95% are classified as 
"sporadic," for which no genetic link to development has been identified. 

Breast cancer susceptibility genes BRCAl (GenBank Accession # U14680) and BRCA2 

30 (GenBank Accession # U43746) are tumor suppressor genes. Germ-line mutations of BRCAl and 
BRCA2 are responsible for approximately 5-10% of all epithelial ovarian cancers (see Li and Karlan, 
Curr, Oncol Rep. 3:27-32, 2001). Of inherited breast cancers, it is believed that inherited mutations 
in BRCAl or BRCA2 are responsible as many as 70% of all cases. 

Those with inherited mutations in BRCAl and BRCA2 have an approximately 63% lifetime 

35 risk of developing breast cancer, whereas the general female population has a 12% lifetime risk. The 
BRCAl and BRCA2 gene mutations are more often identified in breast cancer patients with poor 
prognostic &ctors, which are risk factors that predict for poorer treatment outcomes (e.g., estrogen- 
receptor-negative tumors, higher growth rates, age less than 35 at onset of disease, breast cancer in 
bodi breasts). Development of disease in the opposite breast and ovarian cancer also appear to be 
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more common in breast cancer patients with BRCA 1 or BRCA2 mutations. Hence, the presence of 
BRCAI or BRCA2 mutations may indicate a need for more aggressive therapeutic treatments. 

The alleles of BRCAI and BRCA2 must be inactivated before tumor developmient occurs. 
BRCAI mdBRCA2 are believed to take part m a common pathway involved in maintenance of 
5 genomic integrity in cells; however, litfle is known about the specific molecular mechanisms 

involved in BRCA mutation associated (BRCA-linked) ovarian carcinogenesis. For exanqjle, it is not 
known whether BRCAI and BRCA2 mutations affect common or unique molecular pathways in 
ovarian cancer, or if tiiese pafliways overlap with those involved in the formation of sporadic tumors. 
Bofli BRCA proteins have been inplicated in iaspofrtant cellular functions, includmg enibryonic 

10 development, DNA damage repair, and transcriptional regulation (see Scully and Livingston, Nature 
408:429-432, 2000; Zheng et c/.. Oncogene 19:6159-6175, 2000; Welcsh et al. Trends. Genet. 
16:69-74, 2000; and MacLachlan et al, J. Biol Ckem. 275:2777-2785, 2000). BRCAI miBRCA2 
have each been implicated in defective homologous recombination DNA repair (see Arvanitis et al, 
Mtemational Journal of Molecular Medicine 10:55-63, 2002), and it is believed that each may be a 

15 positive regulator of homologous recombination, with BRCA2 potentially interacting with Rad5 1 , a 
central homologous recombination effector protein, BRCAI regulating GADD45, a DNA dairage 
response gene. 

Patients having cervical and endometrial cancer resulting in defects in homologous 
recombination pathways have been shown to respond favorably to radiotherapy (Arvantis et al), 
20 Therefore, patients having ovarian cancer resulting fix)m a defect in BRCAI or BRCA2 may similarly 
benefit from radiotherapy treatment Hence, the ability to classify ovarian cancer patients into groups 
based upon the underlying mutation provides advantages in selecting potential courses of treatment, 
and in deciding whether to pursue a more aggressive course of treatment. 

In sum, there is a need to better understand patterns of gene ejqpression that trigger ovarian 
25 cancer, as well as downstream genes that may serve as indicators of ovarian cancer progression or as 
potential tumor suppressors. 

BRIEF SUMMARY OF THE DISCLOSURE 

30 The present disclosure concerns a method of classifying an ovarian tumor as a BRCAl-like 

or BRCA2-lite or non-BRCA-type tumor, by determining a pattern of e3q)iession in the ovarian 
tumor of a pluraUty of markers listed in Table 1, wherein flie pattern of expression in the ovarian 
tumor is determined relative to a standard ovarian tissue. The pattem of expression of the markers in 
the ovarian tumor is then compared to the pattem of expression of the same markers in tissue from a 

35 known BRCAl-like or BRCA2-like or non-BRCA-type tumor. A similarity of the pattem of 
expression in the ovarian tumor to a pattem of expression of the comparison tissue of the known 
BRCAl-like tumor classifies the ovarian tumor as a BRCAl-like tumor; a similarity of the pattem of 
expression in the ovarian tumor to a pattem of expression of the known BRCA2-like ttunor classifies 
the ovarian tumor as a BRCA2-like ttunor; and a similarity of the pattem of expression in the ovarian 
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tumor to a pattern of expression of the known sporadic tumor classifies the ovarian tumor as a 
sporadic tumor. 

The patterns of expression are determined, for example, by determining a pattern of over- 
expression or imder-expression of the plurality of markers in the ovarian tumor to over-eTcpression or 
5 under-expression of the plurality of markers of Hie conparison tissue. Alternatively, a pattern of both 
over-e3q)ression and under-expression of the plurality of madcers in the ovarian tumor is compared to 
over-expression and under-expression of the plurality of markers in the conaqparison tissue. 

It has also been discovered that ovarian tumors that do not contain a BRCAl or BRCA2 
mutation may be BRCA-l-like or BRCA2-like in that tlie pattern of e3q)ression of the markers is 
10 similar to a tumor having a BRCA-1 or BRCA-2 mutation. Hence tumors that would otherwise be 
considered '*non-BRCA-type" can be classified as BRCA-l-like or BRCA-2-like, which can 
contribute to decisions about treatment and prognosis even in the absence of the mutation. 

Standard ovarian tissue serves as a baseline fi:om which patterns of over expression and 
under expression can be determined. The "standard" ovarian tissue may be, for exanq}le, fi:om an 
15 immortalized ovarian cell, ovarian tissue fi'om a subject not having ovarian cancer, a subject not 
predisposed to developing ovarian cancer, or ovarian tissue from a subject from whom the ovarian 
tumor was obtained at an earlier point in time. It is also possible for the standard tissue to be tumor 
tissue taken from a patient at an earlier point in time, for exanple prior to treatment (for example 
prior to the administration of chenootherapy). However in most instances the "standard" tissue is 
20 ''nonnar* non-tumor ovarian tissue, such as an immortalized ovarian cell line, for exan:q)le an lOSE 
cell line. 

Many different approaches are described in this disclosure for determining the patterns of 
expression, and assessing similarities. In specific exan^les, the patterns of expression are patterns of 
logarithmic expression ratios, hierarchical clustering patterns, or multidimensional scaling patterns. 

25 The patterns maybe con:q)ared visually or statistically to arrive at conclusions regarding similarity of 
the patterns. For exan^le, when a multi-dimensional scaling pattem is used to generate a three- 
dimensional representation of data clusters associated withBRCAl-like, BRCA2-like or non-BRCA- 
like tumors, the position of a data point obtained from the tumor specimen that is being analyzed can 
indicate whether the tumor specimen has a pattem of expression associated with one of these groups. 

30 If the data point from the tumor specimen is present within or closely associated with one of these 
clusters, it is assigned a classification the same as the cluster in which is it contained or with which it 
is associated. 

Another approach to comparing patterns of over expression and under expression is to 
assign different color intensities to standard normal deviation values of the logariflimic expression 
35 ratios. Similarities of color patterns can then be used to arrive at a qualitative assignment of a tumor 
specimen to a classification. In another approach, the logarithmic e?q)ression ratios of the plurality 
noarkers is con9)ared using con^)ound covariate predictor analysis. 

In particular exanq)les discussed herem, a BRCAl-like ovarian tumor is differentiated from 
a non-BRCA-like ovarian tumor by compmng relative logarithmic expression ratios of at least one 
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marker shown in Table 6. In more particular embodiments, the pattern of e3q)ression of all the 
markers in Table 6 (CD72, SLC25A11, LCN2, PSTPIPl, SIAHBPl, UBEl, WAS, IDH2, and PCTKl) 
is compared to the pattern of expression of these same markers in the specimen undergoing 
classification. 

5 In another example, a BRCA2-Vkt ovarian tumor is distinguished from a non-BRCA-like 

ovarian tumor by con^arixig relative logarithmic expression ratios of at least one marker shown in 
Table 7, and in some enibodhnents both of the markers (L0C51 760 and LRPAPl), In yet other 
GxaaaplesBRCAl- anABRCA2-Vik& ovarian tumors are distinguished firomnon-^/^C^-like ovarian 
tumors by comparing relative logarithmic e3q}ression ratios of at least one marker shown m Table 8» 

10 for example PSTPIPl, IDH2, and PCTKl, or all the markers in Table 8. In other exan5)les, a 

BRCAl-Vke ovarian tumor is distinguished from a £/!G12-like ovarian tumor by coniparing relative 
logarithmic expression ratios of at least one marker shown in Table 10, more than one maricer shown 
in Table 10, or all the markers in Table 10. 

The disclosed methods also include selecting a treatment strategy based on classifying the 

15 ovarian tumor as BRCAl-Vkt, BRCA2'VikQ or non-5i?G4-like. For exan5)le, the treatment strategy 
may include selecting a more aggressive treatment regimen for a BRCAlASkQ or BRCA2Ajik& tumor 
(even if the tumor does not contain a BRCAl or BRCA2 mutation). Such treatment regimens can 
include chemotherapy, radio&erapy, or surgical removal of the tumor and/or surrounding tissue. 

In yet other disclosed examples, the expression patterns of a tumor specimen and known 

20 comparison tissue are cono^iared using a database of patterns (for exaiiq)le a database of logarithmic 
expression patterns) associated with BRCA1-Vik&, BRCA2-Vik& or wnrBRCA-Vike ovarian tumors. 
The database can contain, for e-xmsple, expression ratios of the pliurality of markers in standard tissue. 
The patterns of the expression ratios of the plurality of markers of the tumor specimen can then be 
con^ared to &e pattern of egression ratios of the same markers in the standard tissue. 

25 'In some exanq)les, comparisons may be miade just of patterns' Of ovet e^qprtssion, for one^ or 

more markers that is over e7q)ressed as listed in Table 5. Alternatively, coix^arisons maybe made 
just of patterns of under e^jpression. The patterns of e3q)ression may be obtained by using nucleic 
acid sequences of the markers to perform nucleic acid hybridization of specific oligonucleotide 
probes to the nucleic acid sequences. The markers may be amplified prior to performing nucleic acid 

30 hybridization, and expression quantitated to detect a level of differential expression. The markers are 
conveniently provided on an array, such as a cDNA microarray. In one exanq}le the cDNA 
microarray contains at least 50, 100, 200, 400 or more of the markers listed in Table 1. 

The results of tiiese comparisons can be used to diagnose or provide a prognosis of 
progression of ovarian cancer in a subject The patterns of esqnression can also be used to screen for 

35 therapeutic agents for the treatment of ovarian cancer, or monitoring response to therapy in a subject, 
by looking for a return of tiie patterns of expression of the ovarian tumor toward a non-tumor tissue 
pattern. Kits are also provided for perfonrung these analyses, and the kit may include arrays with 
cDNAs of the markers. 
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BRIEF DESCMPTION OF THE FIGURES 
Figure 1 shows the overall expression differences between BRCAl-hke, 5i2C42-like, and 
non-BRCA-]ike ovarian epithelial cancers. Figure 1 A* Multidimensional scaling model based on the 
overall gene expression (6,445 filtered spots. Example 1) iaBRCAlAmked (solid circles), BRCA2' 
5 linked (open circles), and sporadic tumors (asterisks). FIG IB. The magnitude of differences in gene 
e?q)ression between various tumor groups as revealed by the number of genes differentially expressed 
among them using Ihe uniform statistical cutoff P<0.0001 . 

Figure 2 illustrates tiiat ^i;C^/- and £/{C42-discrinciinating genes also segregate sporadic 
ovarian cancers into two groins (j!?i{C^/-like and ^i?C42-like^^ Figure 2A. Hierarchical clustering 

10 of 1 10 non-redundant genes (see Table 9, Addendum) showing significant differential expression 
between ^/{C47-linked and (Bl) and ARCj2-linked (B2) tumors (modified F-test P<0.0001). The 
red and green color mtensities represent standard normal deviation (Z-score) values from the mean 
expression level of each gene (represented as black) across sixty-one tumor samples (Example 1). 
Figure 2 A* is a duplication of Figure 2A, but is printed in grey tones rather than in color. Figure 2B. 

15 Hierarchical clustering of sporadic and MC4-linked tumor samples based on the expression pattern 
of the 1 10 MC4-discriminating patterns of gene expression. The B2-, and C-labeled san^les 
signify5i?Cvl7-liiiked,5i?C42-linked, and sporadic tumors, respectively. Figure 2C. Hierarchical 
clustering of sporadic samples in the absence of BRCA-linked tumors reveals two major clusters 
corresponding to BRCAl-typQ and BRCA2-typG patterns of gene e^qpression. 

20 Figure 3 shows molecular profiles of sixty-one tumors as defined by the genes whose 

expression significantly differentiated BRCAl and BRCA2 tumors (i'<0.0001) (see Example 1, and 
Table 9). The red and green color intensities represent expression levels shown as standard normal 
deviation (Z-score) values from the mean e?q)ression level of each gene (represented as black) across 
sixty-one tumor samples. The genes are numbered consecutively 1-61 in Figure 3A, and 62-1 16 in 

25 Figure 3B. ' Figure 3A' and Figure'3B' are duplications of Figure 3A and Figure 3B, respectively, 
but are printed in grey tones rather than in color. Figure 3C shows the correlation of the designated 
rows to genes and SEQ ID NOs for the molecular profile in Figure 3A and Figure 3D shows the 
conelation of the designated rows to genes and SEQ ID NOs for the molecular profile in Figure 3B. 

Figure 4 shows gene expression differences between BRCA-linked and sporadic tumors. A 

30 rnodified F-test with a statistical significance level of P<0.0001 was used to evaluate genes 
differentially expressed between tumor types. The red and green color intensities represent 
expression levels shown as standard normal deviation (Z-score) values fi:om the mean expression 
level of each gene (represented as black) across all sixty-one tumor samples. Each gene name is 
followed by the corresponding I.MA.G.E. clone number spotted on the array. Figure 4A. Genes 

35 differentially expressed between BRCAl Aicked (B) and sporadic (Q samples. Genes located on 

Xp 1 1 appear in red. Figure 4B. Exan;)les of genes differentially e^qsressed between BRCA2-'&aked 
(B2) and sporadic (C) san:q)les. Figure 4C. Examples of differentially e:!q>ressed genes between the 
combined BRCAl- and £i2C^2-linked groiip (B and B2, respectively) and the sporadic (C) sanq)les. 
FIG 4A-C' is a duplication of Figure 4A-C, but is printed in grey tones rather than in color. 
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Figure 4D BRCAIAmked tumors exhibit significantly higher expression levels (P<0.001) of all six 
genes mapped to Xpll .23 compared to the sporadic cancers. Error bars reflect standard error. 

Figure 5 is a bar graph showing an evaluation of gene expression patterns common to 
BRCAAmksd and sporadic tumors. Figure 5A shows the expression of twenty-five genes that 
5 showed two-fold or greater down-regulation as coii9)ared to the lOSE reference cell line. Figure 5B 
shows the expression of twenty-five genes that showed two-fold or greater up-regulation as conq)ared 
to the lOSE reference cell line. Error bars reflect standard error. (FOSy HE4 and CD24) have been 
previously reported to be ovete?q>ressed in ovarian cancers. Several of the overe?q)ressed genes that 
have been demonstrated to be interferon-responsive are presented in italics. The * symbol denotes 

1 0 immediate-early response genes. 

Figure 6 is a series of bar graphs illustrating semi-quantitative RT-PCR (sqRT-PCR) 
analysis of gene expression confirms Ate cDNA microarray data. Expression patterns of select genes 
were examined using sqRT-PCR in representative BRCAl-\wk&A (bars 1-5), 5i?C42-linked (bars 6- 
10), and sporadic (bars 1 1-15) san^les. The expression level of each gene in the tumor samples was 

15 coii^>ared to those of normal postmenopausal ovary (N) and the reference lOSE cells (R). All data 
has been normalized to p-actin is presented as fold expression conq)ared to the lOSE reference RNA, 
Figure 6A shows results for genes HE4, RSGl, and CD74. Figure 6B shows results for genes 
ZFP36, T0P2A and HLA-DRBl. 

20 BRIEF DESCRIPTION OF THE TABLES 

TABLE 1 (see Addendum) lists 822 ovarian cancer-related nucleic acid molecules that 
show altered egression in ovarian cancer. The nucleic acids are identified by their SEQ ID NO, 
their gene name (if one has been assigned), the I.M A.G.E Clone ID nuniber associated with the 
nucleic acid sequence, the UniGene nuniber (if one has been assigned), and a descnption of liie gene 
25 (if known). Because more than one GenBank Accession Nunober is sometimes provided for a given 
nucleic acid molecule, the Table groups the SEQ ID NO assigned to each GenBank Accession 
Nuniber with nucleic acid molecule. For example, the entry for BCKDHB in Table 1 provides SEQ 
ID NOs: 16-17 (represented by GenBank Accession number AA427739 and GenBank Accession 
number AA434304). Each of the 822 SEQ ID NOs are included in the attached sequence listing. 
30 TABLE 2 catalogs the clinicopathologic features of the tumor san^jles in a study of sixty- 

one cases of pathologically-confirmed epithelial ovarian adenocarcinoma. 

TABLE 3 lists representative gene-specific primer sequences used to anapKfy RNA for 
analysis by semi-quantitative PGR. 

TABLE 4 (see Addendmn) lists markers that were under-expressed in ovarian cancer in a 
35 comparison of ovarian epithelial cancer cells to normal postmenopausal ovarian tissue. 

TABLE 5 (see Addendum) lists markers that were ove-rexpressed in ovarian cancer in a 
con^)arison of ovarian epithelial cancer cells to normal postmenopausal ovarian tissue. 

TABLE 6 (see Addendum) lists markers that were differentially expressed between BRCAI- 
linked and sporadic tunobors in a con^)arison to reference immortalized ovarian epithelial.cells. 
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TABLE 7 (see Addendum) lists markers that were differentially expressed between BRCA2- 
linked and sporadic tumors in a comparison to reference immortalized ovarian epithelial cells. 

TABLE 8 (see Addendum) lists markers that were differentially ejqjressed between 
combined BRCAlAmk^A and By^G42-linked versus sporadic tumors in a con^arison to reference 
S immortalized ovarian epithelial cells. 

TABLE 9 (see Addendum) lists markers fliat were differentially ejqjressed between BRCAI- 
linked and BRCAZ-Hnked tumors in a comparison to reference imnoortalized ovarian epitiielial cells. 

TABLE 10 (see Addendum) lists markers that can be used to segregate BRCAlASks from 
BRCA2'Wss tumor types using compound covariate prediction analysis. 
10 TABLE 11 (see Addendum) lists the results of confound covariate predictor analysis for 

the sixty-one tumors disclosed herein, analyzed using the madcers in Table 10. 

BRIEF DESCMPHON OF THE SEQUENCE LISTING 

The nucleic and amino acid sequences listed in the acconq)anying sequence listing are 
15 shown using standard letter abbreviations for nucleotide bases, and single letter code for amino acids, 
as defined in 37 C.F.R. § 1.822. Only one strand of each nucleic acid sequence is shown, but the 
coiiq)lementary strand is understood as included by any reference to the displayed strand. In the 
accompanying sequence Usting: 

20 SEQ ID NO: 1 is a 63-nucleotide synthetic primer containing a T7 RNA polymerase binding 

site. 

SEQ ID NOs: 2 and 3 are ACTB gene-specific primers used for anq)lification during semi- 
quantitative RT-PCR- 

SEQ ID NOs: 4 and 5 are HE4 gene-specific primers used for amplification during semi- 
25 quantitative RT-PCR. 

SEQ ID NOs: 6 and 7 are ZFP36 gene specific primers used for amplification during semi- 
quantitative RT-PCR. 

SEQ ID NOs: 8 and 9 are RGSl gene specific primers used for an^lification during semi- 
quantitative RT-PCR. 

30 . SEQ ED NOs: 10 and 1 1 are CD74 gene specific primers used for aitplification during semi- 

quantitative RT-PCR. 

SEQ ID NOs: 12 and 13 are T0P2A gene specific primers used for amplification during 
semi-quantitative RT-PCR 

SEQ ID NOs: 14 and 15 are HLA-DRBI gene specific primers used for anqjlification during 
35 semi-quantitative RT-PCR 

SEQ ID NOs: 1 6 through 822 are ovarian cancer-related nucleic acid molecules that show 
altered expression in ovarian cancer. These nucleic acid molecules are listed in Table 1, and Uieir 
sequence information is provided in the attached sequence listing. 
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DETAILED DESCRIPTION 



Abbreviations 



5 



cDNA: compleineBtary DNA 

DNA: deoxyribaimcleic acid 

EUSA: enzyme-lmkediximLunosorbenta 
EST: expressed sequence tag 

LM.A.G.E.: Integrated Molecular Analysis of Genomes and their Expression 



Consortium 



10 



15 



lOSE: immortalized ovarian sur&ce epitiielial ceU lines 

MDS: multidimensional scaling 

PCR: polymerase chain reaction 

RIA: radioimmunoassay 

RNA: ribonucleic acid 

RT-PCR: reverse transcription-polymerase chain reaction 

siKNA: small inhibitory RNA molecule 

sqRT-PCR: semi-quantitative reverse transcription-polymerase chain reaction 

STS: sequence-tagged site , 



20 II, Terms 

Unless otherwise noted, technical terms are used according to conventional usage. 
Definitions of common terms in molecular biology may be found in Benjamin Lewin, Genes V, 
published by Oxford University Press, 1994 (ISBN 0-19-854287-9); Kendrew et al (eds.). The 
Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0-632-02182- 

25 9); and Robert A. Meyers (ed.). Molecular Biology and Biotechnology: a Comprehensive Desk 
Reference, published by VCH PubUshers, Inc., 1995 (ISBN 1-56081-569-8). 

In accordance with the present disclosure, conventional n:iolecular biology, microbiology, 
and recoxnbinant DNA techniques wilhin the skill of the art are used. Such techniques are fully 
explained in the literature (see, eg., Sambrook et al, 1989. Molecular cloning, a laboratory manual 

30 2™^ ed. Cold Spring Harbor Labdratoiy, Cold spriiig Harbor, New York; Glover, 1985, DNA Cloning: 
A practical approach, volumes I and II oligonucleotide synthesis, MRL Press, LTD., Oxford, U.K.; 
Hames and Higgins, 1985, Transcription and translation; Hames and Higgins, 1984, Animal Cell 
Culture; Freshney, 1986, Immobilized Cells And Enzymes, JRL Press; and Perbal, A Practical Guide 
to Molecular Cloning, John Wiley & Sons, New York, 1988). 

35 In order to facilitate review of the various embodiments of the invention, the following 

explanations of specific terms are provided: 

Altered expression or differential expression refers to cTcpression of a nucleic acid (&g., 
mRNA or protein) in a subject or biological sample firom a subject that deviates from that expression 
in a subject or biological san^le from a subject having normal (wild-type) characteristics for the 

40 biological condition associated witii the nucleic acid. Normal expression can be found in a control, a 
standard for a population, etc. For instance, where the altered e3q)re5sion manifests as a diseased 
condition, such as growth of a tumor or neoplasia or ontset of a cancer such as ovarian cancer, 
characteristics of normal expression noight include an individual who is not suffering from the 
condition (e.g., a subject not displaying neoplasia growth or not having ovarian cancer), a population 
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standard of individuals believed not to be suffering from flie disease, etc. For instance, certain altered 
expression (such as altered expression of eiBRCA nucleic acid), can be described as being associated 
with the biological conditions of altered (eg., over-expressed or under-expressed) nucleic acid 
e)q>ression and a tendency to develop gynecological cancer, such as ovarian cancer. Likewise, 
5 altered ejqnression may be associated with a disease. The term "associated with" includes an 
increased risk of developing the disease. 

Controls or standards (e.g., a reference cell line, such as immortalized epiflielial ovarian 
cells) for comparison to a sample (e.g., an ovarian cancer tumor), for tiie determination of altered 
e3q)ression, include sainples believed to be normal for the studied characteristic, as well as laboratory 

10 values, even though possibly arbitrarily set, keeping in mmA that such values may vary from 
laboratory to laboratory. Laboratory standards and values may be set based on a known or 
determined population value and may be supplied in the format of a graph or table that permits easy 
comparison of measiured, experimentally determined values. 

When used in reference to a nucleic acid, amplification includes techniques that increase 

15 the number of copies of a nucleic acid molecule in a sample or specimen. An example of 

amplification is the polymerase chain reaction, in which a biological san5)le collected from a subject 
is contacted with a pair of oligonucleotide primers, under conditions that allow for the hybridization 
of the primers to nucleic acid iQinphtQ in the sanq>le. The primers are extended imder suitable 
conditions, dissociated from the tenq)late, and then re-annealed, extended, and dissociated to anq)lify 

20 the number of copies of the nucleic acid. The product of in vitro an:5)lification can be characterized 
by electrophoresis, restriction endonuclease cleavage patterns, oligonucleotide hybridization or 
ligation, and/or nucleic acid sequencmg, using standard techniques. Other examples of m vitro 
anoplification techniques include strand displacement annplification (see U.S. Patent No. 5,744,3 1 1); 
transcription-free isothermal an5>lification {see U.S. Patent No. 6,033,881); repair chain reaction 

25 amplification (see WO 90/01069); ligase chaiii reaction ainplification (see EP-A-320 308); gap filling 
ligase chain reaction amplification (see U.S. Patent No. 5,427,930); coiq>led ligase detection and PGR 
(see U.S. Patent No. 6,027,889); andNASBA™ RNA transcription-free anplification (see U.S. 
Patent No. 6,025,134). 

An array is an arrangement of molecules, particularly biological macromolecules (such as 

30 polypeptides or nucleic acids) or cell or tissue samples, in addressable locations on or in a substrate. 
The array may be regular (arranged in uniform rows and columns, for instance) or irregular. The 
number of addressable locations on the array can vary, for exan^le from a few (such as three) to 
more than 50, 100, 200, 500, 1000, 10,000, or more. A microarray is an array that is mmiaturized so 
as to require or be aided by microscopic examination for evaluation or analysis. A cDNA 

35 microarray is an array of multq>le cDNA molecules, fixed in addressable locations, to which 
complementary nucleic acids in appKed san^les may hybridize (see Hegde et al, Biotechnigues 
29(3): 548-562, 2000). cDNA microarrays of flie disclosure provide for qualitative and quantitative 
analysis of gene expression of the molecules contained in the array. 
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Within an array, each arrayed sample (feature) is addressable, in that its location can be 
reliably and consistently determined within the at least two dimensions of the array. Thus, in ordered 
arrays the location of each sample is assigned to the san:5)le at the time when it is applied to the array, 
and a key may be provided in order to correlate each location with the appropriate target or feature 
5 position. Often, ordered arrays are arranged in a symmetrical grid pattern, but samples could be 
arranged in oflier patterns in radially distributed lines, spiral lines, or ordered clusters). 
Addressable arrays usually are conq)uter readable, in that a conoqputer can be programmed to correlate 
a particular address on the array with information about the sanople at that position (e.g., expression 
data, including for instance signal intensity as well as the identity of the sample). In some examples 

10 of con9)uter readable formats, the individual features in die array are arranged regularly, for instance 
in a Cartesian grid pattern, which can be correlated to address information by a corrqputer. 

The sarnple application location on an array (&e "^feature") may assimoie many different 
shapes. Thus, though the term "spot" may be used herein, it refers generally to a localized placement 
of molecules or tissue or cells, and is not limited to a round or substantially round regioiL For 

15 instance, substantially square regions of appUcation can be used with arrays encompassed herein, as 
can be regions that are, for exanq^le, substantially rectangular, triangular, oval, irregular, or another 
shape. Within a single array, feature shapes do not usually vary, though they will in some 
embodiments. 

In certain example arrays, one or more features will occur on the array a plurality of times 

20 (e.g,^ twice) to provide internal controls, 

A biological sanq>le is any sample in which the presence of a protein and/or ongoing 
eTqpression of a protein may be detected. Suitable biological san^les include samples containing 
genomic DNA or RNA (including mRNA), obtained from body cells of a subject, such as but not 
limited to those present in peripheral blood, urine, saliva, cells obtained by pap smear, sera, tissue 

25 biopsy, surgicd specimeii, amniocentesis saicples and autopsy iiiaterial. 

A BRCAl-Vikd tumor is a tumor in which the gene expression pattern is substantially similar 
to the gene expression pattern in a tumor from a subject who has a mutation in BRCAL Similarly, a 
BRCA2'^\i^ tumor is a tumor m which the gene expression pattern is substantially similar to the 
gene e3q)ression pattern in a tumor from a subject who has a mutation in BRCA2, As described 

30 herein, sporadic tumors may share gene expression patterns with BRCA-Unked and or BRCA24inked 
tumors. Hence, sporadic and other tumors (such as tumors for which no BRCA genetic test has been 
conducted) that have gene expression patterns sunilar to a BRCAl-linked tumor are **BRCAl-hke*' 
tumors. 

A cancer is a biological condition in which a malignant tumor or other neoplasm has 
35 undergone characteristic anaplasia with loss of differentiation, increased rate of growth, invasion of 
surrounding tissue, and/or which is capable of metastasis. 

The term cancer includes ovarian cancer, such as ovarian epithelial cancer, which originates 
in the ovaries and may manifest as epitfaehal tumors, germ cell tumors, or stromal tumors. Also 
. included are different stages of a single cancer, for instance both primary and recurrent ovarian 



wo 03/068054 



PCT/US03/04688 



11 

cancer, and cancer at any progressive stage, such as Stages I-IV. Ovarian cancer is considered a 
gynecological cancer. 

A subject may be classified into an ovarian cancer stage based upon evaluation of a 
biological sample &om the subject for indices kno\m in the art or disclosed herein as being indicative 

5 of that stage of ovarian cancer. For exaxnplc, a subject may be classified as having a cancer state of 
cancer-free, active ovarian cancer stage I, H, HI, or IV ovarian cancer), or in remission from 
previous ovarian cancer. 

cDNA is a piece of DNA lacking internal, non-coding segments (introns) and regulatory 
sequences that determine tcansctiption. cDNA is generally sjmthesized in the laboratory by reverse 

1 0 transcription from messeng^ RNA extracted from cells. 

Compound covariate prediction analysis is a method of predicting into which of two 
groups a sample will be assinged usmg a given statistical signficance cutoff (e.g., P<0.0005). The 
method creates a multivariate predictor for one of two classes to each sairtple and includes in the 
multivariate predictor only those components (eg., nucleic acids expressing on a cDNA microarray) 

15 that meet the statistical signficance cutoff. The multivariate predictor is a weighted linear 

combination of logarithmic ratios for components that are univariately significant The weigjit 
consists of the univariate t-statistics for conq)aring the classes. 

DNA is a polymer that comprises the genetic material of most living organisms (some 
vimses have genomes conqprising KNA). The repeating units in most natural DNA polymers are four 

20 different nucleotides, each of which comprises one of the four bases, adenine, guanme, cytosine and 
thymine, bound to a deoxyribose sugar to which a phosphate group is attached. Triplets of 
nucleotides (referred to as codons) code for each amino acid in a polypeptide, or for a stop signal 
The term codon is also used for the corresponding (and complementary) sequences of three 
nucleotides m the mRNA into which the DNA sequence is transcribed. 

25 Unless otherwise specified, any reference to a DNA molecule is intended to include the 

reverse complement of that DNA molecule. Except where single-strandedness is required by the text 
herein, DNA molecules, though written to depict only a single strand, encompass both strands of a 
double-stranded DNA molecule. Thus, a reference to flie nucleic acid molecule that encodes a 
specific protein, or a fragment thereof, encompasses both the sense strand and its reverse 

30 complement. Thus, for instance, it is appropriate to generate primers from the reverse complement 
sequence of the disclosed nucleic acid molecules. 

An expressed sequence tag (EST) is a unique stretch of DNA within a codiag region of a 
gene that is useful for identifying full-length genes and serves as a landmark for gene mapping. An 
EST is a sequence tagged site (STS) derived from cDNA. 

35 Expression of a gene is the process by which the coded information of a gene is converted 

iato an operational or non-operational part of a cell, often including the synthesis of a protein. Gene 
expression can be influenced by external signals. For mstance, e^qposure of a cell to a hormone may 
stimulate expression of a hormone-induced gene. Different types of cells may respond differently to 
an identical signal. 
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Expression of a gene also may be regulated in the pathway from DNA to RNA to protein. 
Ways in which regulation occurs include through controls acting on transcription, translation, RNA 
transport and processing, degradation of intermediary molecules such as mRNA, or through 
activation, inactivation or conq)artmentalization or degradation of specific protein molecules after 
5 they have been made. 

Changes in gene e?q>ression may be associated with specific types of cancer (and cancer 
progression). Such association is fairly specific to the type of cancer, and tiius what is overexpressed 
in one cancer may be imderexpressed (or unchanged) in another. 

The egression of several genes may be groiq)ed into an expression pattern or expression 
10 profile. Such patterns or profUesxnay be imique to an individual sample depending iq)on certain 
factors, for instance biological stimuli introduced into the subject from which the sample was taken 
(e.g., a hormone) or ongoing disease within the subject (e.g., ovarian cancer). Thus, a collection or 
set of genes/proteins that are differentially regulated in a specific cancer may be indicative and 
specifically diagnostic of that type of cancer. In addition, specific expression patterns may indicate 
15 particular mutations within the individual that correlate and/or cause the disease, for instance a 
mutation in BRCAl or BRCA2y or may indicate a larger class of disease, such as a BRCAJAikQ or 
BRCA2'likQ cancer. Furthermore, changing the expression patterns of these genes to restore the 
normal state, or bring the condition closer to the normal state in one or more characteristic, may 
constitute a treatment for cancer. 
20 As disclosed herein, the expression pattern of an unknown tumor may be compared to the 

e3q3ression pattern of known BRCAl -linked and BRCA2-linked markers to determine if the 
expression patterns are sufficiently sitnilar to classify the unknown as a BRCAl-like or BRCA2-Iike 
tumor. 

Gene amplification or genomic amplification is an increase in the copy nmnber of a gene 
25 or a fragment or region of a gene or a3S0ciated 5 * or 3 ' region, as conq)ared to the copy nuinber in 
nomoal tissue. An example of a genomic anq)lification is an increase in the copy nuniber of an 
oncogene. A "gene deletion" is a deletion of one or more nucleic acids normally present in a gene 
sequence and, m extreme examples, can include deletions of entire genes or even portions of 
chromosomes. 

30 A gene expression fingerprint (or profile) is a distinct or identifiable pattem of gene 

expression, for instance a pattem of high and low e3q)ression of a defined set of genes or gene- 
indicative nucleic acids such as ESTs; in some instances, as few as one or two genes may provide a 
profile, but often more genes are used m a profile, for instance at least fliree, at least 5, at least 10, at 
least 20, at least 25, or at least 50 or more. Gene expression fingerprints (also referred to as profiles) 

35 can be linked to a tissue or cell type, to a particular stage of normal tissue growth or disease 

progression, or to any other distinct or identifiable condition that iofluences gene expression in a 
predictable way. Gene eTipression fingerprints can include relative as weU as absolute expression 
, levels of specific genes, and often are best viewed in the context of a test saii^>le compared to a 
baseline or control sdxx3ple fingerprint By way of example, a gene cjqjression profile may be read on 
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an array (e.g., a polynucleotide or polypeptide array). Arrays are now well known, and for instance 
gene expression arrays have been previously described in published PCX application number 
W09948916 ("Hypoxia-Inducible Human Genes, Proteins, and Uses Thereof), incorporated herein 
by reference in its entirety. 

5 As disclosed herein, the gene expression proJBle of an unknown tumor may be con:5)ared for 

similarities and differences to the expression profile of a tumor known to express in a BRCA-like 
manner (e.g., a BRCAl-like or BRCA2-like tumor). 

A genomic target sequence is a sequence of nucleotides located in a particular region in the 
human genome that corresponds to one or more specific genetic abnormalities, such as a nucleotide 

10 polymorphism, a deletion, or anq)]ification. The target can be for instance a coding sequence; it can 
also be the non-coding strand that corresponds to a coding sequence. 

Gynecological cancers are cancers of ^ female reproductive system, and include cancers 
of the uterus (e,g., endometrial carcinoma), cervix cervical carcinoma), ovaries (e,g., ovarian 
carcinoma, serous cystadenocarcinoma, mucinous cystadenocarcinoma, endometrioid tumors, 

15 celioblastoma, clear cell carcinoma, unclassified carcinoma, granulosa-lhecal cell tumors, Sertoli- 
Leydig cell tumors, dysgerminoma, malignant teratoma), vulva (e.g., squamous cell carcinoma, 
intraepithelial carcinoma, ade^iocarcinoma, fibrosarcoma, melanoma), vagina (e.g., clear cell 
carcinoma, squamous cell carcinoma, botryoid sarcoma), embryonal rhabdomyosarcoma, and 
&llopian tubes (e.g., carcinoma). 

20 An isolated biological con:q}onent (such as a nucleic acid, peptide or protein) has been 

substantially separated, produced apart from, or purified away from other biological components in 
the cell of the organism in which the conq>onent naturally occurs, i.e., other chromosomal and 
extrachromosomal DNA and RNA, and proteins. Nucleic acids, peptides and proteins tiiat have been 
isolated thus include nucleic acids and proteins purified by standard purification methods. The term 

25 also embraces nucleic acids, peptides and proteins prepared by recombiiiant e3q)ression in a host cell 
as well as chemically synthesized nucleic acids. 

A marker is a diagnostic indicator of disease. A marker may consist of any signal 
indicating the presence of the disease, e.g., a physiological change in the body of a subject or 
increased or decreased levels of a substance such as a protein correlated to the disease. Markers are 

30 often found in body fluid samples from a subject By way of example, prostate specific antigen is a 
tumor marker used to detect progression of prostate cancer. The molecules disclosed herein, for 
instance in Table 1 are usefiil as tumor markers for diagnosing, prognosing, staging, preventing, and 
treating cancerous disease, such as ovarian cancer. 

A mutation includes any change of the DNA sequence wifliin a gene or chromosome. In 

35 some instances, a mutation will alter a characteristic or trait (phenotype), but this is not always the 

case. Types of mutations include base substitution point mutations (e.g., transitions or transversions), 
deletions, and insertions. Missense mutations are those that introduce a diiBferent amino acid into the 
sequence of the encoded protein; nonsense mutations are those that introduce a new stop codon. In 
the case of insertions or deletions, mutations can be in-firame (not changing the firame of fiie overall 



wo 03/068054 



PCT/US03/04688 



14 

sequence) or frame shift mutations, which may result in the misreading of a large nurober of codons 
(and often leads to abnormal termination of the encoded product due to the presence of a stop codon 
in the alternative frame). 

This term specifically encompasses variations that arise through somatic mutation, for 
5 instance those that are found only in disease cells, but not constitutionally, in a given individual. 

Examples of such somaticaUy-acquired variations include the point mutations that frequently result in 
altered function of various genes that are involved in development of cancers. This term also 
encompasses DNA alterations that are present constitutionally, that alter the function of the encoded 
protein in a readily demonstrable manner, and that can be inherited by the children of an affected 

10 individual. In this respect, the term overlaps with "polymorphism," as defined below, but generally 
refers to the subset of constitutional alterations that have arisen wifliin the past few generations in a 
kindred and that are not widely disseminated in a population group. In particular embodiments, die 
term is directed to those constitutional alterations that have major inq)act on the health of affected 
individuals, such as those resulting in onset of a disease such as a gynecological cancer. 

15 An oligonucleotide is a plurality of joined nucleotides joined by native phosphodiester bonds, 

between about 6 and about 300 nucleotides in length. An oligonucleotide analog refers to moieties that 
function similarly to oHgonucleotides but have non-naturally occurring portions. For example, 
oligonucleotide analogs can contain non-naturally occurring portions, such as altered sugar moieties or 
inter-sugar linkages, such as a phosphorothioate ohgodeoxynucleotide. Functional analogs of naturally 

20 occurring polynucleotides can bind to RNA or DNA, and include peptide nucleic acid (PNA) 
molecules. 

Particular oligonucleotides and oligonucleotide analogs can include linear sequences up to 
about 200 nucleotides in lengfli, for exan9)le a sequence (such as DNA or RNA) fliat is at least 6 bases, 
for example at least 8, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100 or even 200 bases long, or from about 6 to 
25 about 50 bases, for example about 10-25 bases, such as 12, 15 or 20 bases. 

A neoplasm is a new and abnormal growfli, particularly a new growth of tissue or cells in 
which the growth is uncontrolled and progressive. A tumor is an exanople of a neoplasiXL 

A non-BRCA-type tumor is a tumor in which the gene expression pattern of the BRCAl- 
linked and BRCA2-linked markers disclosed in Table 1 is not similar to either a BRCAl-like or 
30 BRCA2-like gene expression pattera 

A nucleic acid is a deoxyribonucleotide or ribonucleotide polymer in either single or double 
stranded form, and unless otherwise limited, encompasses known analogues of natural nucleotides 
that hybridize to nucleic acids in a manner similar to naturally occurring nucleotides. 

A nucleic add sequence (or polynucleotide) is a DNA or RNA molecule, and includes 
35 polynucleotides encoding foil-length proteins and/or fragments of such full length proteins which can 
function as a therapeutic agent. 

Nucleotide includes, but is not limited to, a monomer that includes a base linked to a sugar, 
such as a pyrimidiae, purine or synthetic analogs tiiereof, or a base linked to an amino acid, as in a 
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peptide nucleic acid (PNA). A nucleotide is one monomer in a polynucleotide. A nucleotide sequence 
refers to the sequence of bases in a polynucleotide. 

A &st nucleic acid sequence is operably linked with a second nucleic acid sequence when die 
first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. 
5 For instance, a promoter is operably linked to a coding sequence if the promoter affects the 

transcription or expression of the coding sequence. Generally, operably linked DNA sequences are 
contiguous and, where necessary to join two protein-coding regions, in the same reading frame. 

An ovarian cancer-related molecule includes nucleic acids (such as DNA or RNA or cDNA) 
and proteins that are altered (for exsanplG by mutation or abnormal expression) in ovarian cancer. 
10 Pharmaceutically acceptable carriers include coispositions and formulations suitable for 

pharmaceutical delivery of the nucleotides and proteins herein disclosed. Martin, Remington 's 
Pharmaceutical Sciences, published by Mack Publishing Co., Easton, PA, 19th Edition, 1995, 
describes conventional pharmaceutically acceptable carriers. 

In general, the nature of the carrier will depend on the particular naode of administration being 
1 5 employed. For instance, parenteral formulations usually comprise injectable fluids that include 

pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced 
salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (e.g., powder, 
pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, 
pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to biologically- 
20 neutral carriers, pharmaceutical compositions to be administered can contain minor amoimts of non- 
toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering 
agents and the like, for example sodium acetate or sorbitan monolaurate. 

Primers are short nucleic acids, preferably DNA oligonucleotides 10 nucleotides or more in 
length, which are annealed to a complementary target DNA strand by nucleic acid hybridization to 
25 form a hybrid between the primer and the target DNA strand, then extended along the target DNA ~ 
strand by a DNA polymerase enzyme. Primer pairs can be used for amplification of a nucleic acid 
sequence, by the polymerase chain reaction (PGR) or other nucleic-acid anplification methods 
known in the art. 

Primers as used in the present disclosure preferably comprise at least 10 nucleotides of the 
30 nucleic acid sequences that are shown to encode specific proteins. In order to enhance specificity, 

longer primers may also be enq)loyed, such as primers that comprise 15, 20, 30, 40, 50, 60, 70, 80, 90 
or 100 consecutive nucleotides of the disclosed nucleic acid sequences. Methods for preparing and 
using probes and primers are described in the references, for example Sambrook et al (1989) 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York; Ausubel et al (1987) 
35 Current Protocols in Molecular Biology^ Greene Publ. Assoc. & Wiley-Intersciences; Innis et al. 
(1990) PCR Protocols, A Guide to Methods Ofid Applications, Innis et al. (Eds.), Academic Press, 
San Diego, CA. PGR primer pairs can be derived from a known sequence, for example, by using 
con5)uter programs intended for that purpose such as Primer (Version 0.5, 1991, Whitehead Institute 
for Biomedical Research, Cambridge, MA). 
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When referring to a primer, the term specific for (a target sequence) indicates that the primer 
hybridizes under stringent conditions substantially only to &e target sequence in a given saiqple 
coii9)rising the target sequence. 

A probe conq)rises an isolated nucleic acid attached to a detectable label or other reporter 

5 molecule. Typical labels include radioactive isotopes, enzyme substrates, co-factors, Ugands, 

chemiluminescent or fluorescent agents, haptens, and enzymes. Methods for labeling and guidance in 
the choice of labels appropriate for various purposes are discussed, e.g., Sainbrook et al (In 
Molecular Cloning: A Laboratory Manual^ CSHL, New York, 1989) and Ausubel et al (In Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, 1998). 

10 A protein is a biological molecule e;q)ressed by a gene and coniprised of amino adds. 

A purified molecule is one that has been purified relative to its original environment The 
term "purified" does not require absolute purity; rather, it is intended as a relative term. Thus, for 
exanple, a purified protein preparation is one in which the protein referred to is more pure than the 
protein in its natural enviroimient within a cell or within a production reaction chamber (as 

15 appropriate). Non-limiting exanq)les of purified molecules are those that are 50%, 75%, or 90% 
pure. 

A recombinant nucleic acid is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two otherwise separated segments of sequence. This 
artificial combination is often acconq)lished by chemical synthesis or, more commonly, by die 

20 artificial manipulation of isolated segments of nucleic acids, e,g, by genetic engineering techniques 
such as those described in Sambrook et al, In Molecular Cloning: A Laboratory Manual^ CSHL, 
New York, 1989. The t^m reconibinant includes nucleic acids that have been altered solely by 
deletion of a portion of Ihe nucleic add. For mstance, a plasmid is reconibinant if some portion of 
the naturally occurring plasmid has been deleted. Equally, if the sequence of such a plasmid has been 

25 altered, for exanq>le by a nucleotide substitution (or addition or deletion), that plasmid is said io be 
recombinant 

Sequence identity is the similarity between two nucleic acid sequences, or two ammo acid 
sequences is e7q}ressed in terms of the similarity between the sequences, otherwise referred to as 
sequence identity. Sequence identity is frequently measured in terms of percentage identity (or 

30 similarity or homology); the higher the percentage, the more similar are the two sequences. Methods 
of alignment of sequences for comparison are well-known in the art. Various programs and 
alignment algorithms are described in: Smith and Waterman, J. Tlxeor, Biol 91(2): 379-380, 1981; 
Needleman and Wunsch, / Mol Bio. 48:443-453, 1970; Pearson and Lipman, Metlwds in Molec. 
Biology 24: 307-331, 1988; Higgins and Sharp, Gene IZa^l-l^, 1988; Higgms and Sharp, CABIOS 

35 5:151-153, 1989; Corpet etal. Nucleic Acids Research 16:10881-10890, 1988; Huang et al, 

Computer Applications in Biosciences 8:155-165,1992; and Pearson et al., Meth, Mol Bio, 24: 307- 
331,1994. Altschule/ a/., iVa^ Genet. 6(2): 119-129, 1994 presents a detailed consideration of 
sequence alignment methods and honouology calculations. 
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The NCBI Basic Local Alignment Search Tool (BLAST) (see Altschul et al. J. Mol BioL 215: 
403-410, 1990) is available from several som-ces, including the National Center for Biotechnology 
Information (NCBI, Bethesda, MD) and on the Internet, for use in connection with the sequence 
analysis programs blastp, blastn, blastx, tblastn and tblastx. The Search Tool can be accessed at the 
5 NCBI website, togeflier with a description of how to determine sequence identity using this program. 

Nucleic acid sequences that do not show a high degree of identity can nevertheless encode 
similar ammo acid sequences, due to the degeneracy of ttie genetic code. It is understood that 
changes in nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
miolecules that all encode substantially the same protein. 
10 Serial analysis of gene expression (SAG£) is the use of short diagnostic sequence tags to 

allow the quantitative and simultaneous analysis of a large number of transcripts in tissue, as 
described in Velculescu et al. Science 270:484-487, 1995. 

A standard is a reference against which a value (e.g., level of eaqiression of a maricer) can 
be compared. By way of example, a non-cancerous cell Une may be used as a standard for conQ}aring 
1 5 the level of expression of tumor markers in an ovarian tumor sanaple. Non-hmiting examples of 

standards useful with the disclosed methods of analysis of patterns of expression of markers include a 
non-cancerous sample (e.g., normal ovarian tissue), a san5)le from a subject prior to development of a 
cancer or at an earher stage of the cancer, and a cell line (e.g., immortalized ovarian epithelial cells, 
such as lOSE cells) considered to display wild-type expression levels of the markers. In some 
20 embodunents, a reference RNA is arbitrarily chosen, but used consistently in relation to all tumor 
samples. 

A subject is a living multi-cellular vertebrate organisms, a category that includes both 
human and non-human maTmnfllj^. 

A therapeutic agent, as used in a generic sense, is a con5}osition used for treating a subject, 
25 such as a pharmaceutical or prophylactic agent 

A transformed cell is a cell into which has been introduced a nucleic acid molecule by 
molecular biology techniques. As used herein, the term transformation enconqjasses all techniques 
by which a nucleic acid molecule might be introduced into such a cell, including transfection wifli 
viral vectors, transformation with plasmid vectors, and introduction of naked DNA by 
30 electroporation, lipofection, and particle gun acceleration 

Treating a disease includes inhibiting or preventing the partial or full development or 
progression of a disease (eg., ovarian cancer), for example in a person who is known to have a 
predisposition to a disease. An example of a person with a known predisposition is someone having 
a history of breast or ovarian cancer in his or her family, or who has been exposed to factors that 
35 predispose the subject to a condition, such as exposure to radiation. Furthermore, treating a disease 
refers to a therapeutic intervention that ameliorates at least one sign or symptom of a disease or 
pathological condition, or interferes with a pathophysiological process, after the disease or 
pathological condition has begun to develop. 
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In some aspects, a more aggressive treatment may be selected if warranted. By way of 
exanq)le, if a subject is found to have a BRCAl-likQ or 5i?G42-like gene e^qpression pattern, a more 
aggressive treatment, such as chemotherapy, radiotherapy, or surgical removal of fiie affected tissue 
and/or surrounding area maybe selected. 
5 A tumor is an abnormal mass of tissue, or neoplasm that may be eiflier malignant or non- 

malignant. **Tumors of the same tissue type" refers to primary tumors originating in a particular 
organ (such as breast, ovary, bladder or lung). Tumors of the same tissue type may be divided into 
tumor of different sub-types, for exanq>le ovarian carcinomas can be further classified based on 
tumor histology as 

10 adenocarcinoma, serous, endometrial, clear cell or mixed. Tumors may also be classified according 
to a genetic abnormality associated with the development of that type of tumor. By way of exanq>Ie, 
a tumor associated wilfa a defect in tumor suppressor genes BRCAl or BRCA2 is referred to herein as 
a ^^BRCAl' or BJ?C12-linked" tumor. As described herein, a sporadic ovarian tumor is a tumor 
arising for a reason other than a mutation in BRCAl or BRCA2, However, the similarities in the 

1 5 pattern of expression of ovarian cancer markers in sporadic tumors to those in jSi?G4 /-linked and 
BRCA2'\mkQd tumors can be used to classify sporadic tumors into "jBi?G<i-like" or "MG42-like" 
tumors, using the methods of the disclosure. A "non-BRCA-type" tumor is one that has a pattern of 
expression of ovarian cancer markers unlike a BRCAl-like or BRCA2-like tumor. 

A vector is a nucleic acid molecule as introduced into a host cell, thereby producing a 

20 transformed host cell. A vector may include nucleic acid sequences that permit it to replicate in the 
host cell, such as an origin of replication, and may also include one or more therapeutic genes and/or 
selectable niarker genes and other genetic elements known in the art A vector can transduce, 
transform, or infect a cell, thereby causing the cell to express nucleic acids and/or proteins other than 
those native to the cell. A vector optionally includes materials to aid in achieving entry of the nucleic 

25 acidintothecell, such as a viral particle, liposome, protein coating or the like. 

Unless otherwise explained, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. 
The singular terms "a," "an," and "the" include plurals unless the context clearly indicates otherwise. 

30 Similarly, the word "or" is intended to include "and" imless the context clearly indicates otherwise. 
"Comprises" means "includes." It is further to be understood that all base sizes or amino acid sizes, 
and all molecular weight or molecular noass values, given for nucleic acids or polypeptides are 
approximate, and are provided for description. Although methods and materials similar or equivalent 
to those described herein can be used in the practice or testing of the present invention, suitable 

35 methods and materials are described below. All pubUcations, patent applications, patents, and other 
references mentioned herein are incorporated by reference in their entirety. In case of conflict, the 
present specification, including e3q)lanations of terms, will control. In addition, the materials, 
methods, and exan^les are illustrative only and not intended to be limiting. 
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iZT. Description of Several Specific Embodiments 

Provided herein are methods of diagnosing or prognosing development or progression of 
ovarian cancer in a subject, which methods involve detecting altered expression of at least one marker 
5 a nucleic acid molecule such as one listed in Table 1 or genes, cDNAs or other polynucleotide 

molecules comprising one of the listed sequences, or a fragment thereof, or a protein, such as one 
encoded by such a nucleic acid molecule, or fragnnient of such protein). In certain embodiments, 
altered expression is detected in more than marker, for instance in at least 50, at least 100, at least 
200, or at least 400 or more nucleic acid molecules listed in Table 1, or encoded for by a nucleic acid 

10 molecule listed in Table 1 . In certain specific enobodunents, no more than the molecules listed in 
Table 6, Table 7, Table 8, Table 9, Table 10 or Table 1 1 are included in such analysis. 

Additionally provided herein are mediods for the classification of ovarian tiunors as BRCAl- 
like, BRCA2Aike or non-BRCA-like tumors based jspon expression profiles of selected markers. 
Using the expression profile data, multiple types of comparisons can be made to provide qualitative 

IS and quantitative information about the tumor type. Non-limiting examples of such comparisons 
include visual examination of color profiles of hierarchically clustered markers on a cDNA 
microarray, multidimensional scaling to the determine relative distance of the analyzed markers, and 
compoimd covariate prediction analysis to statistically classify a given tumor into one of two classes 
based upon the logarithmic expression ratio of the expression of at least one known classifying 

20 marker. In a specific non-limiting exanqjle, logarithmic expression ratios are generated and used to 
classify tumor types by comparing to markers known to have a logarithmic expression ratio 
associated with BRCAl-^, BRCA2ASks or non-BRCA-like tumors (see Example 4). 

Also encompassed herein are arrays containing two or more disclosed noaikers. Certain of 
such arrays are nucleic acid arrays that contain at least one noarker, for instance at least one or more, 

25 such as 5, 10, 15, 25, 50, 100, 150, 200, 250, 300, 350, 400 or more nucleic acid molecules listed in 
Table 1 (or genes, cDNAs or other polynucleotide molecules conoprising one of the listed sequences, 
or a fragment thereoQ, or a fragment of such proteia, or an antibody specific to such a protein or 
protein fragment. Such arrays can also contain any particular subset of the nucleic acids (or 
corresponding molecules) listed in Tables 1-1 1 or all of those nucleic acids. Certain arrays (as well 

30 as the methods described herein) also may include nucleic acid molecules that are not listed in Table 
1. 

Certain of the enconipassed methods involve measuring an amount of the ovarian cancer- 
related molecule in a sanq)le (such as a serum or tissue sample) derived or taken from the subject, in 
which a difference (for instance, an increase or a decrease) in level of the ovarian cancer-related 
35 molecule relative to a standard such as a sample derived or taken from the subject at an earlier time, 
is diagnostic or prognostic for development or progression of ovarian cancer. 

In some embodiments, altered expression of ovarian cancer-related nucleic acid molecules is 
detected using, for instance, in vitro nucleic acid aixplification and/or nucleic acid hybridization. The 
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results of such detection methods can be quantified, for instance by deternrining the amount of 
hybridization or the amount of aiiQ>lification of the nucleic acid molecules. 

Specific embodiments of methods for detecting altered expression of at least one ovarian 
cancer-related molecule use the arrays disclosed herein. Such arrays may be nucleotide (e.g,, 

5 polynucleotide or cDNA) or protein (e.g., peptide, polypeptide, or antibody) arrays. In such methods, 
an array may be contacted with polynucleotides or polypeptides (respectively) from (or derived from) 
a sample from a subject. The amount and/or position of expression of the subject's polynucleotides 
or polypeptides then can be determined, for instance to produce a gene expression profile for that 
subject. Such gene expression profile can be compared to another gene expression profile, for 

10 instance a control gene e^qpression profile from a subject having a known ovarian cancer-related 
conditioiL Similarly, protein anays can give rise to protein expression profiles. Both protein and 
gene e^qiression profiles can more generally be referred to as expression profiles. Expression profile 
data can be used to generate logarithmic expression ratios for use in conipound covariate prediction 
analysis. 

15 Other embodiments are methods that involve providing nucleic acids from the subject; semi- 

quantitatively amplifying the nucleic acids to form nucleic acid an^Iification products using primers; 
quantifying the amount of the nucleic acid ampHfication products; and conq)aring results to 
expression levels obtained using cDNA microanalysis. The sequence of such primers may be 
selected to bind specifically to a nucleic acid molecule listed in Table 1, or a nucleic acid molecule 

20 represented by those Hsted in Table 1. In specific exan^)les of such methods, the primers are selected 
to wjsphfy a nucleic acid product encoding topoisomerase n {TOP2A) (SEQ ID NO: 448), regulator 
of G-protein signaling 1 (RGSl) (SEQ ID NO: 398), invariant gamma-chain-associated protein 
(CD74) (SEQ ID NO: 89-91), epididynris-specific, whey-acidic protein (HE4) (SEQ ID NO: 60), 
major histoconq>atibility con:?}lex, class II, DRbeta 1 protem (HLA-DRBl) (SEQ ID NO: 87-88), or 

25 zinc finger protein (ZIT36) (SEQ ID NO: 167-168). 

Also enconq)assed are methods of ovarian cancer therapy, in which classification of a tumor 
of a patient into a BRCAl^-VikR^ BRCAl-Wt or non-BRCA-like tumor type aids in the selection of a 
treatment regimen. In some exan:q)les, the treatment selected is specific and tailored for the subject, 
based on the analysis of that subject's profile for one or more ovarian cancer-related molecules. 

30 Other embodiments are kits for classifying tumors into a 5i?C4i-like, BRCA2-Vk.t or non- 

BRCA-like tumor class, which kits may include a binding molecule that selectively binds to the 
marker that is the target of the kit In some exarr^les of such kits where the marker is an ovarian 
cancer-related protein, the binding molecule provided in the kit may be an antibody or antibody ' 
fragment that selectively binds to the target ovarian marker protein. In other examples of such kits 

35 where the ovarian cancer-related marker level is a nucleic acid, the binding molecule provided in the 
kit may be an oligonucleotide capable of hybridizing to the nucleic acid marker molecule. 

Furdier embodiments are methods of screening for a confound useful in treating, reducing, 
or preventing ovarian cancer or development or progression of ovarian cancer. Such methods involve 
detemiining if a test compound alters the gene e7q)ression profile of a subject (or cells of an in vitro 
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assay) so that the profile more closely resembles a wild-type expression profile than it did prior to 
such treatment, and selecting a compoimd tiiat so alters the gene expression profile. In specific 
exanq)les of such methods, the test confound is applied to a test cell. In some of such methods, the 
profile is determined or measured in an array format. 
5 Also encon^assed are compounds selected using the methods described herein, which are 

useful in treating, reducing, or preventing ovarian cancer or development or progression of ovarian 
cancer. 

Also disclosed herein are uses of identified target ovarian cancer-related molecules for the 
development of antibodies, including therapeutic antibodies that affect an ovarian cancer-related 
10 pathway. It is also envisioned that the disclosed ovarian cancer-related molecules can be used as 
vaccines, for instance as "cancer vaccines" to elicit an immune response firom a subject that renders 
the subject more resistant to developmg or progressing durough a stage of ovarian cancer. 

IV. Gene expression profiling of ovarian cancer tumor tissue using disclosed markers 
15 The present disclosure concerns gene expression profiling of ovarian tumor tissue from a 

subject for use in diagnosing, prognosing, staging, preventing, and treating the disease. Measurement 
of expression of genes within a tissue san5>le provides ioformation regarding proteins that may be 
active during cancer mechanisms. Hence, the gene expression profile of tumor tissue may be 
conipared against the profile for known markers for ovarian cancer, such as those disclosed herein 
20 (see Table 1). 

Usmg the gene e3q)ression profile, an ovarian tumor firom a subject may be classified into a 
BRCAJ4Ske, BRCA2-]iks^ or noBrBRCA-iike tumor. Because the prognosis for a patient havmg a 
BRCAl or BRCA2 mutation is poorer than for patient having non-BRCA-like mutations, 
classification of tumors into these groups is helpfiil in selecting treatment strategies and aids a 

25 clinician in deciding whether to employ a more aggressive regimen in treating the patient, for 
instance radiotherapy, chemotherapy, or surgical removal of the a£fected tissue. In addition, 
classification of a sporadic tumor into a 5i?C4i-like or BRCA2AjkG classification may provide 
similar guidance in treating the patient. For exanq)le, a subject who has a BRCAJ-Uke or BRCA2A&G 
sporadic tumor may be treated similarly to a subject who has a BRCAJAmked or BRCA2']mkGd 

30 tumor. The identification of BRCAl- or ^i^C42-like sporadic tumors also allows tumors (or subjects) 
to be selected for specific drug regimens that are particularly effective with the associated mutation 
type. 

The ovarian cancer-linked markers disclosed herein are believed to be usefid as diagnostic 
or prognostic indicators of ^i^C^i-Uke, £i^C42-like and nonrB/^Cl-like ovarian cancer. In addition, 
35 the naarkers are believed to be useful in applications for treating ovarian cancer as the basis of new 
therq)eutic targets, for the development of new anti-cancer therapeutic coii9)ounds, and/or to select 
particularly appropriate existmg treatments. For exanq>le, the expression levels of tiiese markers can 
be examined to monitor the effectiveness of anti-cancer treatments where an increase in or decreased 
level of nucleic acid expression opposite of the ovarian cancer-indicative pattem disclosed herein 
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indicates an effective anti-cancer treatment. Beyond use in generating an expression profile, certain . 
of the identified genes or EST sequences provided herein are believed to have individual use as 
cancer naarkers. 

5 A* Generating gene expression information and logaritlunic expression ratios, 

cDNA microanalysis allows for simultaneous analysis of the e3q)ression of multiple genes 
within various tissue san^les, and is therefore useful in generating gene expression profiles. To 
perform cDNA microarray analysis, KMA is isolated from a subject and cDNA is synthesized from 
the RNA according to standard methods (see Sambrook et al^ Molecular cloning, a laboratory 

10 manual. 2^ ed. Cold Spring Harbor Laboratory, Cold spring Harbor, New York, 1989). Relative 
over-expression of the mRNA in the cancerous tissues can be measured against non-cancerous 
reference baselines (e.g., ovarian tissue &om a subject not having ovarian cancer or an ovarian cell 
line, such as an immortalized ovarian cell Une), to provide a framework for determining normal 
expression versus altered expression (genes that are either overexpressed or undere3q)ressed). 

15 Nucleic acids that are overexpressed may be used as markers for ovarian cancer, while genes that are 
underexpressed may be putative tumor suppressors. 

cDNA microarrays containing 7,651 sequence- verified features were constructed and 
applied to analyze the mRNA expression profile of sixty-one subjects with pathologically-confirmed 
epithelial ovarian adenocarcinoma having matched clinicopathologic features (see Alizadeh et aL^ 

20 Nature 403: 503-51 1, 2000; Perou et al. Nature 406: lAl-lSl, 2000; Bubendorf a/., J Natl Cancer 
Inst, 91(20): 1758-64, l999;Vfdsket al.,Proc Natl Acad Sci. USA 98: 1176-1181,2001). These 
included eighteen cases linked to BRCAl founder mutations, sixteen cases linked to BRCA2 founder 
mutations, and 27 cases negative for any founder mutations (e.g., sporadic ovarian q>ithelial cancer). 
These samples were compared to expression levels of these same features in an immortalized normal 

25 ovarian surface epi±elium cell line (lOSB). Statistical tools, including a modified F-test with ' 

P<0.0001 considered significant (e.g., a 99.99% confidence level) were tiien used to analyze tiie data 
(e.g., to differentiate gene expression profiles associated with ovarian cancer), enabling a 
comprehensive, genomics-based analysis of the mRNA expression profiles of these ovarian cancer 
subjects. 

30 The logarithmic expression ratios for the spots on each array were normalized by subtracting 

the median log ratio for the same array. Data were filtered to exclude spots with size less than 25 
jim, intensity less than two times backgroimd or less than 300 units in both red and green channels, 
and any flagged or missing spots. In addition, any features found to be missing or flagged in greater 
than 10% of the auays were not included in the analysis. Application of these filters resulted in the 

35 inclusion of 6,445 of the total 7,651 features in subsequent analyses. Statistical conparison between 
tumors groups was performed using the "BRB Array Tools" software (developed by Dr. Richard 
Simon and Amy Peng, Biometrics Research Branch, Division of Cancer Treatment and Diagnosis, 
NCI, USA), consisting of a modified F-test with P < 0.0001 (99.99% confidence level) considered 
significant (see £xan9>le 4). This stringent P value is selected in lieu of the Bonferroni conection for 
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multiple comparisons, which was deemed excessively restrictive (see Bland and Altman, BMJ,\ 310: 
170, 1995). See Example 4 for an exan[5)le of how an ovarian tumor is analyzed using the disclosed 
methods. 

In addition to statistical analysis, multidimensional scaling (MDS) and hierarchical 
5 clustering techniques using a correlation metric and average linkage were used for evaluating overall 
gene expression. Using these techniques, a large set of genes and other encoding sequences 
e?qpressed sequence tags, ESTs) have been identified (Table 1), the e^qnression of which varies in 
subjects having ovarian cancer (see Addendum). Other confidence levels could be used to select 
ovarian cancer-related molecules, such as 98%, 95%, 90%, 85%, and so forth (see Jain et al^ IEEE 

10 Transacations on Pattern Analysis and Machine Intelligence 22(1): 4-37, 2000). Molecules 

identified as being linked to ovarian cancer (referred to generally herein as ovarian cancer-related 
molecules) using the methods described herein can be arranged on arrays for use in diagnostic and 
prognostic methods. Specific arrays are contemplated that are constructed using molecules identified 
at differing confidence levels. Specific examples of such arrays include arrays that detect altered 

15 e>q)ression of at least 2, 5, 10, 20, 30, or 50 of these molecules. ^ 

B. Comparison of ovarian epithelial adenocarcinoma cells to immortaUzed ovarian 
surface epithelium cells. 

In a comparison of ovarian epithelial adenocarcinoma cells to immortalized ovarian surface 

20 epithelium cells, the largest contrast in gene expression was observed between BRCAI- and BRCA2- 
linked tumors, with multiple genes showing significant differences in expression levels. This group 
of genes was also able to segregate the sporadic tumors into two major ''BRCAI-likt'' and ^'BRCA2- 
like" subgroups, indicating that J[/{C4-related pathways are also involved in sporadic ovarian 
cancers. In addition, two previously imreported gene expression patterns were noted. First, six of the 

25 genes differentially expressed between Si2G4i-linked arid sporadic tumors'map to Xpl 1 .23 and all 
exhibited higher mean e3q)ression levels in the fi/?C47-linked san^les [WAS (SEQ ID NO: 524-526), 
EBP (SEQ ID NO: 529), SMCILI (SEQ ED NO: 529), PCTKl (SEQ ID NO: 527-528), ARAFl (SEQ 
ID NO: 531-532), and UBEl (SEQ ID NO: 533), see Figure 3]. Second, con^ared to immortalized 
ovarian surface epithelium cells, several interferon-inducible genes were noted to be overexpressed in 

30 die majority of all tumor sanq}Ies [SIATl (SEQ ID NO: 73), TNFSFIO (SEQ ID NO: 104-106), 

ABCBl (SEQ ID NO: 164-166), CP (SEQ ID NO: 83-84), HLA-DRBS (SEQ ID NO: 85-86), HLA- 
DRBI (SEQ ID NOS: 87-88, 100, 101-103), CD74 (SEQ ID NO: 92-93), HLA-DRA (SEQ ID NO: 
94-96), HLA-DPA (SEQ ID NO: 97-99), IFITMl (SEQ ID NOS: 50-51, 52-54), IFITM2 (55-57, 58- 
59), A2M(SEq JD NO: 193-195), GJP3 (68-69), IGKC (SEQ ID NOS: 112-1 14, 115-116), SCYBIO 

35 (SEQ ID NO: 120-121), ColSAl (SEQ ID NO: 141-143), HLA-^B (SEQ ID NO: 154-156), and HLA-C 
(SEQ ID NO: 157-159), see Figure 4]. In terms of overall dijfferential gene e3q)ression, BRCAl and 
5/?G42-linked tumors express genes more different from each oflier than from sporadic (pon-PRCA- 
linked) tumor san^>les. 
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The identified ovarian cancer-related genes represent putative mediators of ovarian cancer, 
and as such are candidate targets for the development of novel therapeutics for the treatment of 
ovarian cancer using conventional techniques. By way of exanq)le, a candidate drug, targeted at 
restoring expression of a gene of the disclosure, could be examined using cDNA microarray analysis 

5 for utility in influencing growth of ovarian cancer cells. Thus, use of cDNA microarray techniques 
for genomics-based discovery of genes variably expressed during ovarian cancer provides for the 
identification of novel therapeutic targets for treatment of ovarian cancer. 

It is contenplated that certain of the ovarian cancer markers identified herein encode or 
correspond to soluble proteins, while others encode or correspond to membrane associated or 

10 menibrane integral proteins, some of which are exposed at least to a certain extent on the exterior of a 
cell in which they are expressed. In some embodiments, those ovarian cancer-related molecules that 
are expressed at or on the surface of a ceQ are selected as tiierapeutic targets, for instance for 
targeting with an antibody-based therapy, which is fecilitated by the access of the ovarian cancer- 
related molecule to the extracellular matrix. These ovarian cancer markers may be described as being 

13 "drug accessible." In addition, such soluble ovarian cancer markers, if secreted, may be detected in a 
blood or serum sample fiom the subject 

C. Comparison of ovarian epithelial cancer ceUs to normal postmenopausal ovarian 

samples. 

20 cDNA microarrays containing 7,600 sequence-verified features were constructed and 

appUed to analyze the mRNA expression profile of 61 subjects with ovarian epithelial cancer as 
compared to two nomoal postmenopausal ovarian samples. 

Gene expression in each sanq)le (normal or tumor) was directly conq>ared to a '^reference 
RKA*' consisting of a mix of nine different human cell lines (breast adenocarcinoma, hepatoblastoma, 

25 cervical adenocarcinoma, testicular embryonal carcmoma, glioblastoma, melanoma, liposarc'oma, 
histiocytic lynaphoma, T cell lymphoblastic leukemia, and plasmacytoma/myeloma, Stratagene, La 
Jolla, CA). The raw gene expression data was used to calculate the logarithmic expression ratio for 
each gene. The logarithmic CTcpression ratios ("log ratio") obtained firom this comparison were then 
normalized and statistically compared to one another, providing for indirect conq)axison of gene 

30 expression in tumors and normal ovarian sairq)Ies. This was accomplished by scoring the magnitude 
of differential e3q>ression of each gene (between normal and cancer sauries) according to the 
formula: 

(average cancer log ratio - average normal loe ratiol = magnitude of differential 
35 (standard deviation cancer + standard deviation normal) expression 

In genes showing a large mean expression difference between normal and cancerous samples, the 
magnitude of differential expression has a greater value, while the intra-group variability in 
expression ratios is low. 
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Genes were then ranked according to the magnitude of differential expression and the 
highest-ranking genes were considered to be tiie best candidates for differentiating nonnal from 
malignant ovarian samples (see Furey et al, Bioinformatics 16(10): 906-14, 2000). 

Using these techniques, a large set of genes and other encoding sequences (eg., ESTs) that 
5 are under-expressed in subjects having ovarian cancer have been identified (see Table 4). These 
under-expressed ovarian cancer markers represent putative tumor suppressors, and as such are 
candidate targets for the development of novel therapeutics for the treatment of ovarian cancer using 
conventional techniques. By way of example, induction of e>q}ression of one or more of these 
markers through tiierapeutic means (fi^g,^ induction by a drug or gene therapy) may inhibit tumor 
1 0 growth and/or increase tumor cell death, for instance through stimulation of apoptotic pathways. 

Furthermore, a large set of genes and other encoding sequences (e.g., ESTs) have been 
identified (see Table 5), the e^qiression of which is overexpressed in subjects having ovarian cancer. 
These overexpressed ovarian cancer markers represent putative mediators of ovarian cancer, and as 
such are candidate targets for the development of novel therapeutics for the treatment of ovarian 
15 cancer using conventional techniques. Over-ejqjression of one or more such markers can also be 
detected in the body (for example using a serum test to detect or monitor progression of ovarian 
cancer). 

In addition, six of the markers identified herein (e.g., WAS (SEQ ID NO: 524-526), PCTKI 
(SEQ ID NO: 527-528), USE J (SEQ ID NO: 533), SMCILI (SEQ ID NO: 529), ARAFI (SEQ ID 
20 NO: 531-532), and^BP (SEQ ID NO: 529)) have all been mapped to chromosome Xp 11 (see 

Exan^le 1). Hence, Ifais chromosome could contain additional genes and ESTs that may be useful as 
n:iarkecs for prognosing, diagnosing and monitoring ovarian cancer. The methods of the disclosure 
can be used to find additional genes and ESTs in this region for use as ovarian cancer markers. 

25 K Methods of Classifying Tumors Into Subgroups 

Disclosed herein are multiple me&ods of classifying tumors into subtypes based iq>on the 
e3q)ression of disclosed ovarian tanm markers (see Table 1). 

A. Comparison of raw expression data 

30 The expression data of one or more ovarian cancer markers can be compared between 

samples and analyzed to detect differences in e;q>ression between the markers. The expression of an 
individual marker can be stated in ratio or "fold" form relative to the expression of the standard. For 
instance, in Table 4, the average logarithmic ratio of the gene expression for the standard C^normal") 
for ITM2A (SEQ ID NO: 202) is 1.145, while the average logarithmic ratio of the gene expression in 

35 cancer cells was -2.036. These numbers can be con^ared to derive a value for the difference in 

ejqpression by calculating the expression ratio of each nimiber, and dividing the e^qpression ratio for 
the average log cancer value by the e3qpression ratio for the average log normal value. Hence: 

Expression ratio of avOTge log normal: 2**"^ = 2.21 1 
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Expression ratio of average log cancer 2"^^^^ = 0.244 

Ratio (cancer to normal) = (0.244)7(2.2 11) = 0.1 10 

5 Thus, IIM2A is under-expressed in cancer by a ratio of 0.1 10 to 1 (z.e, in ovarian cancer tissue, 
nM2A ej^resses at approximately 10% of the expression level seen in wild-type cells). 

Collections of such data can be assembled to provide a gene expression proffle, as discussed 
above. With such proffles, the standard deviation of the expression ratio of each gene can be 

10 measured by obtaining the square root of flie variance of the expression data as described by Jaccard 
and Becker (in Statistics for the Behavioral Sciences, 2"^ ed,, Wadsworfli Publishing Co., Belmont, 
California, 1990) and Myers and Well. (mResearch Design and Statistical Anafysis, University of 
Massachusetts, Amherst, Massachusetts, 1995). 

Further analysis can include a Student's t-test, to determine if the mean expression of two 

15 groups (e.g., BRCAl-Hk^ and non-5i2G4-like, 5ieG42«like and non-j?iiC4-like, etc.) are statistically 
different from each other. 

Due to the range over which genes may express, it may be useful to perform statistical 
analyses using the logarithmic expression value for each marker (see Example 1). However, 
calculations using the- logarithmic expression values may dilute the ability of certain analyses to 

20 determine differences. Hence, it may be useful to employ multiple methods of analysis to ascertain 
relative values in e3q>ression (see Jain et al, IEEE Transacations on Pattern Analysis and Machine 
Intelligence 22(1): 4-37, 2000). 

B. Visual analysis of hierarchical clustering 

25 Methods disclosed herein include hiaarchical clustering analysis of genes with statistically 

significant differential expression between sets of tumor groiq)s. Hierarchical clustering can be used 
to cluster objects (e.g., genes, such as the ovarian cancer markers listed in Table 1) to represent 
relationships among the objects. The relationships are represented, for example by a tree whose 
branch lengths reflect the degree of similarity between tiie objects (see e.g., Figure 2B). 

3^ Optionally, hierarchical clustering can be combined with a graphical representation of the 

primary data by representing each data point with a color that quantitatively and qualitatively reflects 
the original experimental observations. The use of color representations, along with statistical 
organization, provides a graphical display that provides visual information about expression of the 
genes. Hence, the methods disclosed herein can provide visual information regarding degrees of 

35 similarity (e.g., patterns of under-ej5)ression or over-ejqpression) between assessed genes in different 
samples, for instance in samples of BRC47-linked, BRCA2-link£id and sporadic ovarian tumor 
san^les (see Figure 2B). 

At the first iteration, each object is considered to be its own group, and the pair of objects 
with the smallest distance between them is merged into a new group. Each subsequent iteration 
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merges two groups to form a new group, xmtil finally all objects end up merged into a single group. 
The classification tree, or dendrogram, graphically represents the sequence of clusters formed at each 
iteration of merges, as well as the distance between clusters at each merge (here, Figure 2). This 
technique is widely employed to represent gene expression information obtained firom microarray 
5 experiments (see Eisen et al, Proc. Natl Acad, Set U,SA. 95(25): 14863-8, 1998). 

The gene expression data disclosed herein were analyzed by calculating the Pearson 
correlation coefScient to obtain a gene expression similarity metric. To describe, Gi is (log- 
transformed) primary gene expression data for gene G in each tumor sample, represented as variable 
L For any two genes A" and F observed over a series of -A^ tumor samples, a similarity score can be 
10 co^^>uted as follows: 

where 

When Goffset is set to the mean of flie gene expression levels of the tumor sanq>les for gene G, then Oq 
becomes the standard deviation of C, and S{X, Y) is exactly equal to the Pearson correlation 

15 coefficient of the gene expression levels for genes A" and 7. Values of Gojset that are not the average of 
the gene expression levels for gene G are used when there is an assumed unchanged or reference state 
(e.g., the gene is not over-expressed or under-expressed) represented by the value of (to^jc, against 
which changes are to be analyzed; in all of the examples presented here, Gogset is set to 0, 
corresponding to a fluorescence ratio of 1.0. 

20 By way of example, Figures 2A and 2A' demonstrate that expression of the disclosed 

markers can be used to visualize difiTeient tumor types. Hierarchical clustering was performed using 
with a Pearson correlation metric and average linkage were used for evaluating overall gene 
expression for the sixty-one ^/?C47-linked, BRCA2AirikGd and sporadic tumors (see Exair^le 1). 
When applicable, all statistical tests were two-sided 

25 In Figure 2, B2 represents BRCA2AinkGd tumors, and B 1 represents BRCAI-lixik&d tumors. 

The red and green intensities represent standard normal deviation (Z score) values from each 
marker*$ means expression level (represented as black) across the sixty-one tumors samples. Red 
represents increased expression and green represents decreased expression. The differences in gene 
expression can be appreciated by lookiag at the groupings apparent in Figure 2A. The genes in the 

30 left half of the Figure 2A are fi:om 5i2G42-linked tumors and tiie genes in the right half are ifrom 
BRCAI']mkGd tumors. As can be seen with casual observation, gene expression between these two 
tumor groups differs relative to the control (lOSE cells). Specifically, BRCA2-lmkGd tumors contain 
under-expressing genes that correlate to these genes in the upper left and lower right quadrants of 



wo 03/068054 



PCT/US03/04688 



28 

Figure 2A, which are represented as primarily green in color. Furthermore, the genes in the upper 
right and lower left quadrant, which are represented as primarily red in color, correlate to genes that 
are generally over-expressed relative to the control lOSE cells. Hence, hierarchical clustering can be 
used to qualitatively visualize diflEerences in the ejqjression patterns of samples. 

5 

C Multidimensional Scaling 

Multidimensional scaling is a dimension reduction procedure that can be used for 
visualization purposes. Each experiment can be represented by its expression profile, which is a K- 
dimensional vector of log-ratios, where K is ttie nuiriber of clones represented after filtering. The 

10 multidimensional scaling procedure reduces each experimenf s e3q)ression profile from K- 

dimensional space to 3-dimensional space, by attempting to preserve distances between the N 
experiment vectors. The distance metric needs to be specified when using the multidimensional 
scaling tool. First, the N x N distance matrix is computed, which quantifies the relationsh5)s between 
the N experiments in the series of chips. For each of the N vectors in K-dimensional space, the 

1 5 multidimensional scaling procedure finds a vector in 3-dimensional space, such that the N x N 

distance matrix computed in 3-dimensional space approximates the N x N distance matrix computed 
in K-dimensional space. The relationships between the N experiments can then be visualized by 
plotting the N vectors in 3-dimensional space, in which each of the N points represents a single 
experiment. A rotating 3-dimensional visualization tool can be used for discovery of experiment 

20 clusters. 

By way of example, the gene expression data of 6445 filtered genetic elements of the sixty- 
one ovarian tumor sanq)les (see Exan^le 1) was used in multidimensional scaling to generate a 3-D 
diagram for visualization of the respective differences between the e3q)ression patterns of each tumor 
sample. As seen in Figure 1, the data segregate into different areas of the 3-D space based on 

25 similarities in gene expression within flie tumor type. In particular, the jMC47-linked'tumors(^ " 
circles) segregate higher into the cube than the 5/ZC12-lniked tumors (open circles). The sporadic 
tumor samples (asterisks) also fell into higher and lower areas of the cube, indicating that fliey 
segregate into BRCAl-t^Q and MG42-type expression patterns. Thus, multidimensional scaling can 
be used to make a qualitative distinction regarding the expression patterns of these san^les. 

30 Multidimensional scaling can be used to qualitatively assess the expression pattem of an 

unknown tumor type. Expression data for a plurality of 5i?G47-type and BRCA2-t^Q markers is 
generated using the tumor tissue (for instance, on a cDNA microarray) relative to a standard ovarian 
tissue (e.g,, from a subject not having ovarian cancer, immortaUzed ovarian epithelial cells, etc.), and 
logarithmic ratios of the gene e3q)ression data are calculated. To con^jare the pattem of expression of 

35 the plurality of the known BRCAI -typ^ and j9iJC42-type markers to the unknown ovarian smsplc, the 
K-dimensional vectors of the logarithmic expression ratios for all e>qjression data are calculated as 
discussed above. Next, the K-dimensional vectors are plotted in a 3-dimensional space and the layout 
of the data conq)ared. Similar to Figure 1, the unknown ssaaplt data should cluster either near the 
BRCAI-]xk& or MG42-like tumors, or alone (which would indicate fliat it is a non-BRCA-like 
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tumor). Hence, multidiinensional scaling can be used to make a qualitative distinction regarding the 
expression patterns of an xmknown saiiq}les in comparison to known BRCAI-typo and BRCA2'type 
markers. In addition, more than one unknown sample can be used in this analysis. 

5 D. Compound Covariate Predictor Analysis 

Segregation into tumor types can be performed using confound covariate predictor analysis, 

which creates a multivariate predictor for one of two classes to each sample (see Exan^qple 4). 

Markers included in the multivariate predictor are those that are univariately significant at the 

selected significance cutoff (e.g., P<0.0005). The multivariate predictor is a weig^d linear 
10 combination of log-ratios (or log intensities for singfle-channel arrays) for genes that are univariately 

significant. The weigjbit consists of the univariate t-statistics for conqparing the classes, and is 

calculated using the equation: 



15 



CCP = 0|1^;*(x,-nv) 



where t/ = t-value for gene / (see Table 10), X/ = logarithmic ratio of the gene expression (i) in the new 
sample to be classified, and m/ = midpoint between the two classes for gene i (see Table 10). The 
index / runs over all the genes that are significant in the original analysis (i.e, all 62 genes in Table 
10), If the log ratio X/ is missing for gene i in the new saii^le to be classified, then it should be 

20 assigned as m^ for that gene, to cause the result of the calculation to be zero for that gene. If the 

conq)ound covariate predictor value is positive, then the tumor classified as one of the first type (e.^., 
BRCAl-like). If the confound covariate predictor value is negative, then the tumor is classified as 
belonging to the second type (e.g., BRCA2-like). 

A second method of tumor classification using a conq>ound covariate predictor model can be 

25 found in Radmacher et aU "A paradigm for class prediction using gene e;q)ression profiles," found 
on tiie National Cancer Institute Internet website. This publication is expressly incorporated by 
reference hereixL 

In order to determine whether a tumor is classified as BRCAlAiki^ or BRCAl-^kR using a 
single markers in Table 10, die following steps are used: 

30 

1 . Gene egression information is obtained, for instance on a cDNA microarray, using 
the same standard (e.g., lOSE cells) that was used to obtain the marker data. 

2. The gene expression data is converted into a logarithmic ratio using log base 10. 
Hence, a tumor that has a gene expression value for gene KIAA00008 of 0.45 would have a log base 

35 10 ratio of -0.346. 

. 3. The midpoint value of Table 10 is subtracted firom the logarithmic ratio, and 
multiplied by the t-value for KIAA00008 for the data set. Thus, 
[(-0.346) - (-.431)] * (-8.0421) = -0.51. 
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The values for the average logarithmic ratio for MC4 /-linked mdBRCA2Ai3okcd values in 
the data set are then consulted. The obtained value will fall between the midpoint and one of these 
values because genes in which larger values of the logarithmic ratio are assigned to one class {e.g.y 
5jRG47-hnked) will have weights of a value that is more negative with respect to the midpoint value 
5 (eg., -0.56864), whereas genes in which larger values of the logarithmic ratios are assigned to the 
odier class (eg., BRCA2-iisik&d) will have weights of a value that is naore positive with respect to the 
mi(^oint value (eg., -0.29414). Hence, the obtained value, 0.1930, would fall on ^ more negative 
side of this data, and would therefore be classified as a £J{C^2-like data set. 

If diis same analysis is performed using multiple markers, the method remains the same 
1 0 except that the data can be summed prior to performing the analysis. This method is a multivariate 
approach of the compound covaxiate analysis, and can be used to determine whether the pattem of 
expression of an unknown tumor is similar to a BRCAl-like or BRCA2-like pattem of eTqxression. 

Further analysis, such as a "leave-one-out" approach may additionally be eniployed to test 
the ability of the Compound Covariate Predictor to classify the tumors into additional subtypes, such 
15 as resistance to a therapeutic compound. See Radmacher et aL^ "A paradigm for class prediction 
using gene e?q)ression profiles," found on the National Cancer Institute Internet website. 

E. Comparisons using Databases 

Due to the large amount of information associated with the analysis methods disclosed 
20 herein, it may be particularly useful to construct and/or consult databases of information for use in the 
analysis. 

By way of exaxiq)le, the information generated by the methods of the disclosure can be 
stored in databases, such as a database of a plurality of markers known to express difTerently in 
BRCAJ-Vke and BRCA2']jke tumors (eg., Table 9). Such databases noay be made publicly available, 

25 such as the Stanford Microarray Database (maintained by Stanford University, see Sheirlock et al.. 
Nucleic Acids Res,, 29(1): 152*155, 2001). These databases maybe used to store reference data for 
use with the classification methods of the disclosure. In addition, such databases can be used to 
provide information regarding markers of potential use in diagnosing, prognosing, or monitoring 
ovarian cancer, for use by clinicians. 

30 The use of databases to search for stored information is disclosed in U.S. 5,871,697 and 

6,519,583 the methods of which are expressly iacorporated herein. 

VI. Kits for measuring the level or function of ovarian cancer-related molecules. 

The nucleic acid sequences and ESTs disclosed herein can be suppUed in the form of a kit 
35 for use in detection or monitoring ovarian cancer. In such a kit, one or more of the nucleic acid 
sequences and/or ESTs in Table 1 are provided in one or more containers, or in the form of a 
microarray. The kit may also contain reagents for use in preparing a biological sanople of a subject 
for screening with the kit. The container(s) in which the reagent(s) and nadcroarray(s) are supplied 
can be any conventional container ftat is capable of holding the supplied form, for instance, plastic 
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boxes, microjEiige tubes, aiEpoules, or bottles. In some applications, negative controls obtained from 
a subject free from ovarian cancer may be provided in pre-measured (e.g., single use) amounts in 
individual, typically disposable, tubes or equivalent containers. Witb such an arrangement, the 
sample to be tested for the presence of ovarian cancer can be added to the testing container and tested 
5 directly. 

The amount/number of each testing reagent and container suppUed in the kit can be any 
appropriate amount, depending for instance on the market to which tiie product is directed. For 
instance, if the kit is adapted for research or clinical use, the amount of each testing reagent and 
container provided would likely be an amount suflHcient to screen several biological sarcples. Those 

10 of ordinary skill in the art know the amount of testing reagent tibiat is appropriate for use in a single 
contamer. General guidelines may for instance be found in Innis et al (PCR Protocols, A Guide to 
Methods and Applications, Academic Press, Inc., San Diego, CA, 1990), Sainbrook et al (In 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York, 1989), and Ausubel et 
al (In Current Protocols in Molecular Biology, Greene Publ, Assoc. and Wiley-Intersciences, 1992). 

15 A kit may include more than two nucleic acid sequences or ESTs, in order to faciUtate 

screening of a larger number of ovarian cancer markers or tumor suppressors. For instance, the 
sequences set forth in Table 1, or a subset of (e.g., 5, 10, 15, 20, 50, 100, 150, 200, 250, 300, 350, 400 
or more) of these sequences, may be provided. By way of example, a provided subset could include 
the markers set forth in Table 6, Table 7, Table 8, Table 9, or Table 10. These sets of sequences are 

20 provided by way of exan5)le only, and are not intended to be limiting examples. 

In some embodiments of the current disclosure, kits may also include the reagents necessary 
to carry out screening reactions, including, for instance, RNA sanq)le preparation reagents, 
appropriate buffers (eg., polymerase buffer), salts (e.g., magnesiimi chloride), and secondary 
detection reagents (e.g., cyanine 5-conjugated dUTP). 

25 Kits may in addition include either labeled or unlabeled sequences for use in detectioii of the 

eTqxression levels. 

Embodiments of the disclosure are illustrated by the following non-limiting Examples. 

EXAMPLE 1 

30 Identification of Genes with Altered Expression in Ovarian Cancer 

This example describes how a first subset of the disclosed ovarian cancer-related nucleic 
acid molecules were identified. These ovarian cancer-related molecules show differences in 
ejqpression in subjects having ovarian cancer connpared to normal ovarian surface epithelial cells and 
35 are classified according to their BRCAA, BRCA-l, and sporadic tumor status. The results of these 
studies have been published in Jazaeri et al, J. Natl Cancer Inst., 94(13): 990-1000, 2002, which is 
incorporated by reference in its entirety hereiiL 
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Methods and Material: 

CHfiicopathoIogic characteristics ofBRCA-linked and sporadic ovarian cancers: Sixty-one 
cases of pathologically-confinned epithelial ovarian adenocarcinoma from the Memorial Sloan- 
Kettering Cancer Center were studied and screened for founder mutations. These included eighteen 
5 cases linked to BRCAl^ sixteen cases linked to BRCA2, and twenty-seven sporadic cases. All patients 
were self-identified as Ashkenazi Jews and after infomoed consent underwent genotyping for germ- 
line founder mutations in BRCAl (185delAG and 5382insC) and BRCA2 (6174delT) (see Boyd et al, 
JAMA: 283: 2260-226S, 2000). Those cases with a BRCA mutation were categorized as having 
hereditary ovarian cancer and those without such a notation as having sporadic ovarian cancer: 

10 Tumor samples: In order to nunimize confounding variables, MC^7-linked» BRCAZ- 

linked, and sporadic tumors of similar stage, grade, and histology were selected fix>mthe sixty-one 
individuals studied [18 BRCAl (185delAG, 5382insC), 16 BRCA2 (6174delt), 27 sporadic tumors). 
The majority of tumors in all three groups were characterized by advanced stage, moderate to high 
grade (grade 2 or 3), and a predominance of serous histology. Hence, the clinicopathologic 

15 parameters of selected samples were well-matched and in agreement with those reported previously 
for these tumors types (see Boyd et a/., JAMA\ 283: 2260-2265, 2000: Ramus et al. Genes Chrom, 
Cancer: 25: 91-96, 1999). 

All tumor samples had been flash frozen, embedded in OCT medium, and stored at -80^ C. 
Isolation of RNA was performed using the RNeasy columns (Qiagen, Valencia, CA) according to the 

20 manufiicturer's instructions. The integrity of RNA was verified by denaturing gel electrophoresis. 
Total RNA was linearly anq)lified usmg a modification of the Eberwine method (see Van Gelder et 
c/., Proc. Natl Acad, Set U.SA. 87: 1663-1667, 1990). Table 2 catalogs the clinicopathologic 
features of the tumor samples studied. 
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Table 2. Clinicopathologic features of tumor samples 


Variable 


^HC^i-linked 


BRCAl-VmkRA 


Sporadic 


Number of samples 


18 


16 


27 


Median Age* (SD) 


50(11) 


60(9) 


69(11) 


Stage 




I 


2(11.1%) 


0 


0 


n 


0 


2 (12.5%) 


3 (11.1%) 


m 


11 (61.1%) 


12 (75%) 


24 (88.9%) 


rv 


5 (27.8 %) 


2(12.5%) 


0 


Grade 




1 


0 


0 


0 


2 


4 (22.2%) 


6 (37.5 %) 


8 (29.6%) 


3 


14 (77.8%) 


7 (43.8%) 


16 (59.3%) 


No. 


0 


3 (18.7%) 


3(11.1%) 


Histology** 




Serous 


9 (50%) 


12 (75%) 


16 (59.3%) 


Endometrioid 


3 (16.7%) 


0 


2 (7.4%i 


Mucinous 


0 


0 


0 


Clear CJell 


2 (11.1%) 


0 


0 


Adenocarcinoma NOS 


3 (16.7%) 


3 (18.8%) 


9 (33.3%) 


other 


1 (5.5%) 


1 (6.2%) 


0 


* F test, P - .0002/ Data are tbe median +/. standard deviation. 

** Chi test P vahie for differences in histology among tumor groups- 0. 17 

NOS = not otherwise specified 



cDNA Microarrays: The cDNA microairays consisted of 7,651 total features representing 
different (non-redundant) genes, and were manufactured at the National Cancer Institute nucroairay 
facility. 

5 Total RNA was reverse-transcribed by using a 63 nucleotide synthetic primer containing the 

T7 RNA polymerase binding site (5 

GGCCAGTGAATTGTAATACGACTCACTATAGGGAGGCGG(T)24-3' (SEQ ID NO: 1). Second 
strand cDNA synthesis (producing double-stranded cDNA) was performed with SNase H, 
Escherichia coli DNA polymerase I, and E, coli DNA ligase (Invitrogen, Carlsbad, CA). After 

10 cDNA was blunt-ended with T4 DNA polymerase (Invitrogen, Carlsbad, CA), it was purified by 
extraction with a mixture of phenol, chlorofomi, and isoamyl alcohol and by precipitation in the 
presence of ammonium acetate and etfaanol. The double-stranded cDNA was flien transcribed using 
T7 polymerase (T7 Megascript Kit, Ambion, Austin, TX), yielding amplified antisense RNA that was 
purified using RNeasy mini-columns (Qiagen, Valencia, CA). Pooled total KNA from two SV40 

15 immortalized ovarian sur&ce epithelial cell-lines (lOSB) was aniplified and used as reference for 
cDNA microarray analysis. 

Four |Ag of amplified RNA was reverse transcribed and directly labeled using cyanine 5- 
conjugated dUTP (tumor RNA) or cyanine 3-conjugated dUTP (lOSE RNA, provided by Dr. Jeff 
Boyd, Memorial Sloan-Kettering). Hybridization was performed in a solution of 5X SSC and 25% 



wo 03/068054 



PCTAJS03/04688 



34 

fonnaniide for 14-16 hours at 42°C. Slides were washed, dried, and seamed using an Axon 4000a 
laser scanner (Axon Instruments, Inc., Union City, CA). 

Imaging and LM.AG,E, Analysis: Fluorescence intensities at tiie immobilized targets were 
measured by using an Axon GenePix Scanner and Genepix Pro 3.0 analysis software (Axon 
5 Instruments, Union City, CA). The raw data were then uploaded to a relational database maintained 
by the Center for Mormation Technology at the National Institutes of Heath. The cDNA clones are 
identified by their Xntegrated Molecular Analysis of Genomes and their Expression Consortium 
(I.MA.G.E.) clone number. 

Amplification of RNA: The first strand of RNA was synthesized, by adding 1-3 ^g of total 

1 0 RNA into a reaction tube (e.g., Eppendori^ or other container of suitable size), adding 1 ^1 T7-(dT)24 
primer (2^g/^l), and bringing to a volume of 20 ^1 with nuclease-fi:ee water. The reaction was 
incubated at 70^C for 10 nmiutes, then spun briefly in a centrifuge and placed on ice. Four ^1 5X first 
strand cDNA buffer was added, tiien 2 jil O.IM DTT, 2|il IQmM dNTP mix (Amersham-Pharmacia, 
Piscataway, NJ 08855-1 327 USA), Ijil RNasin (Promega, Madison, WI 5371 1 USA), and 2 \il 

15 Superscrq)t n. The reaction was mixed well and incubated at 42'*C for 1 hour. The tube was 

centrifiiged briefly, and placed on ice. To synthesize the second strand, 91 ^1 DEPC-treated water 
was added, then 30 ^il second strand bulBfer, 3 ^illO mM dNTP mix, 4 ]il DNA Polymerase I (lOU/ 
^il), 1 ^1 DNA Ligase (lOU/pl), and 1 ^il RNAse H (2U/^1). The final reaction volume equaled 150 
^il. Next, the tube was gentiy tapped to mix, then briefly centrifuged. The tube was incubated at 

20 16°C for two hours, then 2 ^il (lOU) T4 DNA Polymerase was added. The tube was cooled for five 
minutes at 16*C, then tiie reaction stopped with 10 |il of 0.5 M EDTA. Ten \il of IM NaOH were 
added, then the reaction was incubated at 65^C for 10 minutes. Ihs solution was neutralized by 
addition of 25 pi Tris-HQ (pH=7.5). 

Qean Up of Double Stranded cDNA: The Phase Lock Gel (PLG) was pelleted m a 

25 microcentrifuge at maximum speed for 30 seconds, 198 unequal volume) of (25:24:1) Phenol: 

chloroform : isoamyl alcohol (saturated with 10 mM Tris-HCl pH 8.0/1 mM EDTA) was added to tiie 
final DNA synthesis preparation (198(il) to a final volume of 396 ^. The solution was niixed well by 
pipetting up & down vigorously. The entire cDNA-phenol/chloroform mixture was transferred to the 
PLG tube, and microcentrifuged at maximum speed for two minutes. The aqueous supematant was 

30 transferred to a new 1.5 ml tube, and 1 ^1 linear acrylamide was added. 0.5 volumes of 7.5M 

Ammonium Acetate + 2.5 volumes (include the added Ammonium Acetate) of 95% ethanol stored at 
—20 to the sanq)le was added and the solution was vortexed, then centrifuged at maximum speed in a 
microcentrifuge at room temperature for 20 minutes. The supematant was removed and the pellet 
was washed with 0.5 ml of 80% ethanol. The solution was centrifuged at maximum speed for 5 

35 minutes at room tenq)erature. The 80% etiianol was poured off, and the 80% elhanol wash repeated. 
The pellet was air dried for approximately 15 minutes, then resuspended in 16 |il of nuclease-fi:ee 
water. 

In Vitro TYanscription: Using an Ambion T7 Megascrq>t kit.(Ambion, Austin, TX 78744- 
1832, USA), the manufacturer's instructions were followed to create a.40 ^1 reaction {ie,, the 20 ^1 
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standard reaction was doubled and incubated at 3TC for 4-5 hours). The reaction was assembled at 
room temperature using 16^1 of template double-stranded DNA, to avoid tibe precipitation of 
spermidine, which can occur if done on ice. Four ^1 of lOx reaction buffer were added, then 4 \il of 
ATP solution (75mM T7), 4 ^il of CTP solution (75mM T7), 4 ^il of GTP solution (75mM T7), 4 \lI 
5 of UTP solution (75mM T7), 4 jil of Enzyme Mix, and the reaction was mcubated at 37**C for 5-6 
hours. Sixty ^1 nuclease-firee water was added to bring the total volume up to 100 ^I. The RNA was 
"cleaned" using the '"RNA clean-up" protocol provided in the Qiagen RNeasy Mini Handbook, May 
1999, pp. 48-49, Qiagen, Valencia CA. KNA was eluted with 30 \il of nuclease-firee water, and the 
optical density ratio was measured (the san[q)le should have an optical density of greater than 1.8 
10 when measured at 260/280 nanQmeters). The expected yield from this preparation was ten times the 
starting amount of total RNA, and the RNA was then ready for use in generating probe for 
noicroaxrays using total RNA (see below). 

Second Round Amplifications: 0.5-1.0 ^g of aniplified RNA were resuspended in 1 1 ^1 
ultrapure water. 

1 5 First Strand Svnthesis : One jil Random hexamer ( 1 mg/ml) was added and the 

reaction was incubated at 70**C for 10 minutes, then chilled on ice, then allowed to equilibrate at 
room temperature for 10 minutes. Four \il 5X First strand cDNA biiffer, 2 \i[ 0. IM DTT, 2 ^il lOmM 
dNTP mix, 1 \il RNasin were added, and the reaction was mixed incubated at 42°C for 2 minutes. 
Two ^1 Superscript II were added, and the reaction was mixed well and incubated at 42°C for 1 hour. 

20 One ^il RNAse H was added, and the reaction was incubated at 37°C for 20 minutes, then heated to 
95°C for 2 nodnutes to quell the reaction, then chilled on ice. 

Second Strand Synthesis : One fil T7-oligodT primer (0.5 mg/ml) was added, and 
the reaction was incubated at 7(fC for 5 minutes and at 42*^0 for 10 minutes. Then, 91 ^1 DEPC 
treated H2O were added, then 30 ^1 Second strand buffer, 3 ^110 mM dNTP mix, 4 ^1 DNA 

25 Polymerase I (lOU/ jil), 1 \il DNA Ligase (lOU/fil), and 1 \il RNAse H (2U/^1) to alHnal volume of 
150 ^l. The tube was tapped gently to mix, then briefly centrifuged. The reaction was incubated at 
16°C for two hours, flien 2 ^1 (lOU) T4 DNA Polymerase were added, and the reaction was cooled 
for 5 minutes at 16**C, The reaction was stopped with 10 ^l of 0.5 M EDTA, then 10 ^il of IM NaOH 
were added. The reaction was incubated at 65*0 for 1 0 miQUtes, then neutralized by addition of a 

30 solution witii 25 \il Tris-HCl (pH=7.5). The protocol for "Clean Up of Double Stranded cDNA" and 
"In Vitro Transcription" was followed to generate cDNA for use in preparation of the probe for 
microarray hybridization. 

Preparation of Probe and Microarray Hybridization Using Amplified RNA : To prepare the 
probe, reverse transcription labeling reaction mixes were created for each probe containing the 

35 conD5)onent Random Primer (InVitrogen, Carlsbad, California 92008, USA). Three fig/jil in 2^1 were 
added, then 5-6 (ig amplified RNA. The reaction was brought to a final volume of 17 |il with water, 
then incubated at room ten^ for 10 minutes. To each probed, 5X first strand buffer in 8 ^1, 20X 
lowT-dNTP mixin2 jil, 0.1 MDTTm4 jil, RNAsmin 1 jil, Cy-3 or Cy-5 dUTP (NENLife Science, 
Boston, MA 02118-2512 USA) in 4 |xl, and SuperScriptn (GIBCO-BRL, InVitrogen Corporation, 
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Carlsbad, California 92008 USA) enzyme in 2 nl was added. The reaction was incubated at 42°C for 
60 minutes, then 5 ^1 of 500 mM EDTA and 10 jil of IM NaOH were added. The reaction was 
incubated at 65®C for 15 minutes to hydrolyze residual RNA, flien cooled to room temperature. 
Twenty-five 25 jil of 1 M Tris-HQ (pH7.5) was added to neutralize pH. 
5 Probe Cleanup: 500 \il of IX TE were added to a Microcon-YMBO column and the column 

was spun at 13,000 ipm for 5-6 minutes to wash &e column. Membrane integrity was checked by 
looking into the top insert, to confirm that a thm fihn of TE (^-50 fil) covered the membrane. 400^1 
IX TE was added to each of the sanq>le tubes and all contents were transferred to flie washed 
Microcon--YM30 cohunn (Amicon, Millipore Corp., Bedford, Massachusetts). The column was spun 

10 at 13,000 rpm for 5-6 minutes until approximately 50 ^1 was left on the menibraiie. Hie column was 
checked for dye crystals along the edge of the column mendbrane^ which indicated that the probe was 
likely to be good. 450|iil IX TE were added to the column and the column was spun down to ~50^1 
as above. The presence of crystals was confirmed. The Cy-3 labeled probe was placed into a clean 
tube, and the column was spxm at 14,000 rpm for 1 minute to elute the probe. The Cy-3 labeled probe 

15 was added to the Cy-5 labeled probe in the colunon, and approximately 450 \il IX TE was added to 
the column. The column was spun at 13,000 rpm until approximately 13-14 fil of combined probe 
remained on the membrane, which was checked with a pipette. The combined probe was inverted 
into a clean tube, and spun at 14,000 for 1 minute to elute. The probe (14 ^1) was transferred into a 
clean Eppendorf tube and stored at 4 °C until used in the hybridization reaction. 

20 Probe Hybridization: Twenty jil of water were added to each humidifymg well in tibe 

Hybridization Chamber (to maintain humidity). Then, 40 ^1 of prehybridization bufTer (5X SSC, 
0.1% SDS, 1% BSA (Sigma) warmed to 42''C) were placed in the center of the slide and the cover 
slip was placed on the slide, taking care to prevent bubble accumulation beneath the slq). The Tnargin 
clamps on the Hybridization Chamber were finnly attached, and the chamber was incubated at 42*C 

25 for least 1 hour. The slide was washed in distilled water for 2 minutes, followed by isopropanol for 2 
minutes. The slide was dried in a centrifuge (5804R, EppendorQ at 705 rpm ('^TOx g) for 4 minutes, 
then prepared for hybridization as discussed above. The slide was hybridized within 1 hour of the 
prehybridization step. Two 2 ^1 COTl-DNA (Hof&nan La Roche, Nutley, New Jersey 071 10 USA) 
(1 ^g/^il), 2 nl polyA (Sigma) (8-10 jig/jil), and 2 ^1 yeast tRNA (Sigma, Ronkonkoma, NY 1 1779 

30 USA) (4 |ig/|il) were mixed with the probe. Then, the probe was denatured for 1 minute at lOO'C, 
placed briefly on ice to cool the reaction, and spun down in a centrifuge. Twenty jil of 2X 
hybridization buffer (50% formamide, lOX SSC, 0.2% SDS, wanned to 42*C) were added to the 
denatured probe, mixed well (taking care to minimize bubble formation) and kept at 42°C until ready 
to spot on the slide. The hybridization chaniber was prepared as in the prehybridization step with 

35 20^1 of distilled water in each well. The sUdes were placed face-t^ in the chambers, and the probe 
was hybridized with the slide for 14-16 hrs at 42*C. 

Slide Washes: The margin clamps on the Hybridization Chamber were carefiilly removed to 
prevent water from seeping in and contaminating the array. The slide was renooved from flie 
chaniber, held with forceps and the cover slip allowed to fall ofT into the solution containing 2X SSC, 
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0.1% SDS. The slide was washed for 4 minutes in IX SSC, 0.1% SDS, for 4 minutes in 0.2X SSC, 
and for 1 minute in 0.05X SSC. The slide was spun dry in a centrifuge at 705 ipm (approximately 
70x g) for 4 minutes. If water droplets were seen on the slide, it was spin again for anoflier 4 
minutes. Exposure to light was minimized by placing the dried slides in a slide box until ready for 
5 scanning. 

Statistical Analysisx The logarithmic expression ratios for the spots on each array were 
normalized by subtracting the median logarithmic ratio for die same array. Data were filtered to 
exclude spots with size less than 25 fmi, intensity less than 2 times backgroimd or less than 300 units 
in both red and green chaimels, and any poor-quality or missing spots. Iq addition, any features 

10 found to be missing or flagged in greater than 10% of ^e arrays were not included in the analysis. 
Application of these fUters resulted in the inclusion of 6,445 of the total 7,651 features in subsequent 
analyses. Statistical conq>arison between tumors groiq)s was performed using the **BRB Array 
Tools" software (developed by Dr. Richard Simon and Amy Peng, Biometrics Research Branch, 
Division of Cancer Treatment and Diagnosis, NCI, USA). A modified F-test is run on each gene's 

1 5 log-ratio values, and the significance of that gene is determined with P < 0.0001 considered 
significant. This stringent P value is selected in lieu of the Bonferroni correction for multiple 
coniparisons, which was deemed excessively restrictive (Figure 1) (see Bland and Altman, B.M,J.; 
310: 170, 1995). 

Semi-quantitative PCR: Five samples from each tumor group (BRCA-l, BRCA-l, sporadic) 
20 were selected at random. For each sairrple, 3.5 ^g of total RNA was reverse-transcribed using oligo 
dT primers and 400 units of Superscript II reverse transcriptase (Invitrogen, Carlsbad, CA) in the 
presence of all four deoxyribonucleoside 5'-triphosphates (each at 10 mM) (InVitrogen, Carlsbad, 
CA) and 40 units of RNAse inhibitor (Promega, Madison, WI). Reverse transcription was performed 
in a total reaction volume of 40 |il, of which 1 |il was subsequently used for each PCR reaction. 
25 Preliminary experiments were performed to identify optimal c^cle nmnber for each gene. Thirty 
cycles was found to be optimal anplification for all ant^lified RNAs except for HLA-DRBl and 
CD74, which were amplified for 26 cycles. Polymerase chain reaction was performed using the 
GeneAmp PCR kit (PE Applied Biosystems, Foster City, CA) according to the manufacturer's 
instructions. Representative gene specific primer sequences are shown in Table 3: 

30 
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Table 3. Primer pairs for RNA ampIiGcation. 



Gene snecifiG Drimer oair 


SEQ ID NO. 


ACTS 5'-ATGTGGATCAGCAAGCAGGA-3' 
5'-GGTGQCTTTTAGGATGGCAA-3' 


SEQIDN0:2 
SEQ ID NO: 3 


HE4 5'-TTCGGCTTCACCCTAGTCTCA-3" 
5'-AGAGGGAATACAGAGTCCCGAA-3' 


SEQ ID NO: 4 
SEQ ID NO: 5 


2FP36 5'-ACCCTGATGAATATGCCAGCA-3' 
5'-GCrACTTGCTTTTGGAGGGTA-3' 


SEQ ID NO: 6 
SEQ ID NO: 7 


RQSl 5'-GACTCnTATCCCAGGTTCCTCA-3' 
5'-TGACTCCCTGGTnTAAGAGCA-3 ' 


SEQ ID NO: 8 
SEQ ID NO: 9 


CD74 5'-CCAGTCCCCATGTGAGAGCA-3' 
5'-AGCTGATAACAAGCTTGGCTGA-3' 


SEQ ID NO: 10 

SEQ ID NO: 11 


T0P2A 5'-TGTCCCTCCACGAGAAACAGA-3' 
5'-CGTACAGATTTTGCCCGAGGA-3' 


SEQ ID NO: 12 
SEQ ID NO: 13 


HLA-DRBl 5'-GCGAGTTGAGCCTAAGGTGA-3' 
5'-TTGAAGATGAGGCGCTGTCA-3' 


SEQ ID NO: 14 
SEQ ID NO: 15 



Amplified RT-PCR products were visualized on an agarose gel stained with ethidimn bromide. The 
intensity of each band was an indicator of the quantity of DNA, as previously aDoplified by PCR. 
S Thus, fte intensity served as an indirect measure of flie starting amount of the RNA anq)lified from 
the respective gene in each sample. Intensity was quantified using an ultraviolet light source and 
Alpha Imager software (Alpha Innotech Coip, San Leandro, CA). In addition to the above- 
mentioned tumor samples, sqRT-PCR evaluation of selected genes was also performed on the lOSE 
RNA for comparison. 

10 

Results 

Global assessment of eene expression diiferences amons tumor sroups : Prior to 
investigating specific inter-group differences, the overall patterns of gene e^qiression in the three 
tumor types (BRCA-l, BRCA-2, sporadic) were assessed. Multidimensional scaling (MDS), based on 

15 the expression levels of all 6,445 filtered genetic elements in the microarray, revealed that BRCAl- 
and BRCA2'^ijkGd tumors have distinct molecular profiles. In contrast, the sporadic sanq)les showed 
a more heterogeneous distribution pattern, with many patterns clustering near the patterns of BRCAl- 
linked or 5i?G42-lmked san^les (Figure 1 A). The MDS results suggested that Ihe BRCAl- and 
£i!!C42-associated groups would be the most different and that gene expression patterns for each of 

20 the BRCA groups and the sporadic tumors woidd have fewer differences. In support of this 

hypothesis, only a few genes showed statistically significant (P<0.0001) differential expression 
between the sporadic tumors and the BRCAl- or BRCAlAmk&d tumors, whereas 1 10 genes were 
differentially expressed between BjRC4/-liDked and jB/2G42-linked tumors (Figure IB). In addition 
34 EST sequences were differentially expressed between BRCA I- and 5/?G42-liDked tumors. The 

25 group of 144 total markers that were differentially expressed between BRCAl- and BRCA2-typG 
tumor cells compared to normal ovarian epithelial cells is listed in Table 9 (see Addendum). 

Differential gene expression among all three groups was also performed, which identified 60 
genes and 3 EST sequences whose expression segregated jS/{C47-linked, £RG42-linked, and 
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sporadic tumors (modified F test, wherein P<0.0001). Fifty-one of these 63 genes and EST 
sequences were also among the statistically significant discriminators of BRCAl and BRCA2 tumors, 
highlighting the distinct gene expression profiles of these two groups. In addition, the expression 
profile of the combmed BRCAl- and MC42-linked group was remarkably similar to Aat of the 
S sporadic tumors, as demonstrated by only diree genes showing differential e?cpression (PO.OOOl) 
between these groups [PSTPIPl (SEQ ID NO: 538-540), IDH2 (SEQ ID NO: 541-542), miPCTKl 
(SEQ ID NO: 527-528)]. These observations were in agreement with the multiduaaensional scalmg 
analysis and demonstrated that, in terms of the overall pattern of gene expression, the BRCAl- and 
£RC42-linked tumors are distinct from one ano&er. FurOiemQore, the gene expression profiles of the 

10 sporadic tumors ^pear to share features of tiUbisir BRCAl- ox BRCA2'^sik&d cancers, and these 
sporadic tumors are referred to herein as 5/{Ci4/-type or BRCAl-t^t sporadic ovarian tumors. 

The group of 144 nucleic acid molecules Usted in Table 9 was further investigated using 
hierarchical clustering (Figure 2A, B). As expected, the 5i?C4-associated tumors showed distinct 
and contrasting expression profiles (Figure 2A). Strikingly, the sporadic samples also segregated into 

15 two groups based on the expression patterns of the same 144 genes, exhibiting sporadic sample had a 
molecular profile similar to that of eitibier the BRCAl- or the MG42-hnked tumors. This observation 
was illustrated by hierarchical clustering of all samples, revealing distinct *^BRCAl-typG^^ and 
^^BRCA2-typQ" clusters (Figure 2A). This clustering further demonstrates that sporadic tumors 
(which do not contain the BRCAl or BRCA2 mutations) can often be classified as BRCAl -typo or 

20 BRCA2-typQ, Classification of sporadic tumors into these subtypes may provide guidance in treating 
the patient For exan^le, a subject who has a BRCAl -type or BRCA2-typQ sporadic tumor may be 
treated similarly to a subject who has a BRCAl -hiiked or ^i?C42*linked tumor. The identification of 
BRCAl- 0TBRCA2-typG sporadic tumors also allows tumors (or subjects) to be selected for specific 
drug regimens that are particularly effective with the associated mutatioii type. 

25 Color-coding is usually used to represent the relative transcript e^q^ressfon ratio, as me'asured 

by cDNA microarray analysis. Red customarily indicates the maximum point in gene expression, 
green the minimum, and levels closer to the mean approach black 

To ensure that the 5i2C/4-linked sanq)les were not biasing the observed clustering patterns, 
the hierarchical architecture of gene expression in sporadic tumors was examined separately. Even in 

30 the absence of the BRCA-hhked sanq)les, two distinct cluster phenotypes were observed, each 
comprised of those sporadic sanqjles that previously grouped with BRCAl- and Bi?C^2-linked 
tumors (Figure 2B). Tumor histology and patient age were also evaluated for possible confounding 
effects on the observed BRCAl -type and BRCA2-type clusters. Neither variable was found to 
influence clustering patterns (Figure 2A, 2B). 

35 Q&}^^ Mer^fially e3q?re^$ed b^p/^e^n BRQIJI- ^nd fiR,C42-Hnhed oymofi cqrcmomq^: 

The analysis of overall gene expression patterns established that the same genes whose expression 
differentiated BRCAl and BRCA2-^xiksd tumors, also identified two major sub-populations of 
sporadic cancers (Figure 3). As such, these nucleic adds are believed to represent in^ortant 
mediators of common genetic pathways in ovarian cancer and/or carcinogenesis. Many of diese 
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genes are involved in inq^ortant cellular functions including signal transduction, RNA processing and 
translation, chemoldne signaling and immune modification, and DNA repair. By way of example, the 
i?i?G47-associated tumors were characterized by higher AKTl (SEQ ID NO: 504-506) and lower 
PTENiSEX^ ID NO: 507-509) relative expression. In addition UBLl (SEQ ID NO: 510-512) (also 
5 known as SUMOA and sentrin) was more highly expressed in BRCAl- associated tamois. This 
molecule interacts with RAD5 1 and RAD52 and has been proposed to have a regulatory role in 
homologous recombination (see Li et aL, Nuc. Ac, Res: 28: 1 145-1153, 2000). The preferential 
expression of UBLl (SEQ ID NO: 510-512) in Ihe i3i?G4i-Unked samples may prove to be relevant 
to possible differences in DNA repair actions of the BRCA tumor suppressor genes. 
10 By way of example, the J[/iC^2-linked tumors showed higher relative e3q;n:ession of WNT2 

(SEQ ID NO: 513-514 and SFRP4 (SEQ ID NO: 515-517), which are members of flie wnt-p-catenin- 
TCF signaling pathway. Another notable observation is that both BRCAl- and ARG42-linked tumors 
showed preferential expression of proto-oncogenes commonly altered in hematologic malignancies. 
BRCAI tumors showed higher expression levels of RUNXI (SEQ ID NO: 518-520)/AMLI, while 
15 Bi?G42-associated san5)les showed preferential egression of TALI (SEQ ID NO: 521-523)/SCL. 
Both of these oncogenes are transcription factors involved in proliferation, and their preferential 
expression in BRCAl- and 5jRC42"linked tumors may indicate that the activation of such a 
'^proliferation driver" is a necessary step in ovarian carcinogenesis. 

Gene expression differences between BRCA4inked and sporadic tumors: Nine non- 
20 redundant genes showed significant differential expression between £AC4/-linked and sporadic 
tumors [CD72 (SEQ ID NO; 805), SLC25A11 (SEQ ID NO: 544), LCN2 (SEQ ID NO: 545-547), 
PSTPIPl (SEQ ID NO: 538-540). SIAHBPl (SEQ ID NO: 543), UBEl (SEQ ID NO: 533), WAS 
(SEQ ID NO: 524-526), IDH2 (SEQ ID NO: 541-542), PCTKl (SEQ ID NO: 527-528), PO.OOOl, 
Figure 4A . A noteworfliy observation was that three of these genes, WAS (SEQ ID NO: 524-526), 
25 PCTKl (SEQ ID NO: 527-528), and UBEl (SEQ ID NO: 533), have aU been m^ed to the Xpl 1,23 
and all were higher expressed in the BRCAI-Hnk&d tumors. This observation seemed unlikely to be 
explained by chance alone as only 35 of the total 6,445 filtered spots (0.5%) on the microairay 
represent genes mapped to Xpll. To further investigate this pattern, a larger group of 53 genes was 
considered for differential expression between BRCAl AihkQd and sporadic tunaors under the less 
30 stringent significance level of P<0.001. Among this group three additional genes, SMCILI (SEQ ID 
NO: 530), ARAFl (SEQ ID NO: 531-532), and EBP (SEQ ID NO: 529), were discovered that also 
mapped to tiie Xpl 1 ,23 locus and also showed higher mean expression in 5i?C4i-associated samples 
(FIG 4D). Thus, six of fifty-three genes differentially expressed between ARG4/-linked and sporadic 
samples (P<.001) mapped to Xpl 1 .23 and aU showed higher mean e^ipression in jBi{C47-linked 
35 tumors. In silico analysis of the location of these genes revealed that they are all confined to a 5-Mb 
region of DNA m Xpl 1 .23 (Ensemble database, Prous Science, Philadelphia, PA 19102, U.S.A.). 

The comparison between i?/2C/(2-linked and sporadic tumors revealed only two genes with 
differential expression among these groups at the significance level of P<0.0001 (Figure 3). The 
gene designated as L0C51 760 (SEQ ID NO: 534-535) is also known as B/K (encoding the 
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brain/kidney protein) and is moderately homologous to the synaptotagmin family of vesicular 
transport molecules. The second differentially expressed gene encodes low-density lipoprotein- 
related protein-associated protein 1 (LRPAPI), also known as alpha-2-ma<a:oglobulin receptor- ' 
associated protein 1 . 

5 A further conq)arison consisted of investigating gene expression differences between the 

combined jB/{C4-linked group and the sporadic group, which revealed only three non-redxmdant, 
differentially expressed genes [PSTPIPJ (SEQ ID NO: 538-540), IDH2 (SEQ ID NO: 541-542), and 
PCTKl (SEQ ID NO: 527-528), Figure 4C. All three genes were among the group of genes that 
differentiated BRCAI-]mked and sporadic sanq>les. This finding is consistent with the observation 

10 that the RNA profiles of sporadic ovarian cancers share significant similarities with those of BRCAl- 
linked or BRCA2AxsiLtA tumors. It is believed that the similarities shown in flie KNA profiles is a 
general characteristic that applies to gene and protein conq)onent profiles as well. The small nuntiber 
of differentially expressed genes obtained fix>m the con^iarison of the coinbined group to the 
sporadic tumors is the result of the latter also consisting of MC4/-type and£i?Gj|2-type molecular 

15 classes. 

Gene expression features distin^ishim ovarian cancers from ovarian surface epithelial 
cells : Gene expression patterns common among all tumor types were investigated to identify genes 
that may be associated witii the transformed state, Le,, genes commonly e^qjressed in ovarian tumors 
irrespective of their hereditary or sporadic nature. Gene expression in all sixty-one primary tumor 

20 sanq)les was compared to immortalized ovarian surface epithelial (lOSE) cells used as tiie common 
reference. Using the selection criterion of two-fold or, greater expression ratio relative to the lOSE 
reference in at least two-thirds of all tumors, a list of 201 nourredundant genes and ESTs was 
generated. The top twenty-five overexpressed (1L8 (SEQ ID NO: 449-451), GROl (SEQ ID NO: 
452-453), ALDHJA3 (SEQ ID NO: 454-456), MMPl (SEQ ID NO: 457-459), OSF-2 (SEQ ID NO: 

25 460-461), CDC25B (SEQIDNO: 462-464), FLNA (SEQIDNO: 465-467), TFP12 (SEQIDNO: 
468-469), FGF2 (SEQIDNO: 470-472) , CD44 (SEQIDNO: 473-475), DYTl (SEQIDNO: 476- 
477), UCHLl (SEQIDNO: 478), FGF2 (SEQIDNO: 470-472), PLAU (SEQ ID NO: 479-480), 
LDHA (SEQ ID NO: 256), PTGS2 (SEQ ID NO: 481-483), PRNP (SEQ ID NO: 484-486), MTIX 
(SEQ ID NO: 487-488), UGB (SEQ ID NO: 489-490), PBEF (SEQ ID NO: 491-493), TXNRDl (SEQ 

30 ID NO: 494-496), NT5 (SEQ ID NO: 497-499), PTGS2 (SEQ ID NO: 481-483), MT2A (SEQ ID NO: 
500-502), ZNF220 (SEQ ID NO: 503)) and twenty-five down-regulated {FU22174 (SEQ ID NO: 30- 
31), DDRl (SEQ ID NO: 74-76), SERP1NF2 (SEQ ID NO: 18-19), HLA-DRBl (SEQ ID NO: 87-88). 
IFITM2 (SEQIDNOS: 55-57, 58-59), HGF (SEQIDNO: 174-175), SORLl (SEQIDNO: 149-151), 
CP (SEQIDNO: 83-84), HLA-DRA (SEQIDNO: 94-96), BRF2 (SEQIDNO: 190-192), ABCBl 

35 (SEQ ID NO: 164-166), G1P3 (SEQ ID NO: 68-69), RGSl (SEQ ID NO: 122-123), IFITMl (SEQ ID 
NOS: 50-51, 52-54), FOSfSEQIDNO: 133-135), PPP1R7 (179-180), HLA-DPA (SEQIDNO: 97- 
99), HLA-DRB5 (SEQIDNO: 85-86), TLR3 (SEQIDNO: 199-201), ZFP36 (SEQIDNOS: 167-168, 
169-171, 172-173), SGK (SEQIDNO: 176-178), HLA-DRBl (SEQIDNO: 87-88), HE4 (SEQID 
NO: 60), a^74 (SEQ ID NO: 89-91, CD24 (SEQ ID NO: 181-182)) named genes (by order of 
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magnitude) are presented in Figure 4A and 4B. This analysis revealed two potentially significant 
functional groups of genes to be overexpressed in ovarian cancers. The first group consisted of 
several of the genes that have all been previously shown to be interferon-inducible {HLA-DRBl (SEQ 
IDNO: 87-88), HLA-'DRBS (SEQ ID NO: 85-86), HLA-DRA (SEQIDNO: 373-374), HLA-DPA 
(SEQ ID NO: 97-99), CD74 (SEQIDNO: 89-91) JFITMI (SEQIDNOS: 50-51, 52-54), and 
IFITM2 (SEQ ID NOS: 55-57, 58-59), as indicated by itahcs in Figure 4A and 4B). The second 
group consisted of immediate-early response genes (BRF2, ZFP36, SGK, andFQS). In addition, 
several genes previously reported to be overejqaressed in ovarian epithelial tumors were present m the 
list of genes overexpressed in tumors relative to the lOSE cells (Figure 4A and 4B). Elevated levels 
of CLU, CD24, and MUCl were also observed. These results identify additional potential markers of 
ovarian cancer. Table 9 lists the 144 nucleic acids that showed significantly elevated expression in 
ovarian cancer. These genes were selected based on consistency across all the pooled experiments 
and a significant difference in the average expression in the 40 independent samples, using a criteria 
of a tumor-to-ovarian surface epithelial cell line ratio of two or greater in at least 66% of all tumors. 

EXAMPLE 2 

Semiquantitative RT-PCR confirms and complements cDNA microarray data 

This example describes how the results found in the previous example were confirmed using 
20 semiquantitative RT-PCR. 



10 



15 



To validate flie array data, semiquantitative RT-PCR (sqRT-PCR) analysis of several 
mRNAs was performed in a representative subset of tumors consisting of five Ji?G47-Iinked, five 
j8AC^2-linkedy and five sporadic RbTA sanq)Ies. The tumor san:q)les were randonoly selected. The 

25 exi^ession of TOP2A (SEQ ID NO: 448), RGSl (SEQ ID NO: 398, CD74 (SEQ ID NOS: 89-91, 92- 
93), HE4 (SEQ ID NO: 60), HLA-DRBl (SEQ ID NO: 87-88), and ZFP36 (SEQ ID NO: 167-168, 
169-1 7h 1 72-1 73) were evaluated using sqRT-PCR, with P-actin as a normalizing control. Because 
data obtained from cDNA microarrays is in the form of relative expression ratios between tumors and 
the reference, RNA from lOSE cells and a histologically normal, postmenopausal ovarian RNA 

30 sample in the sqRT-PCR e?q)eriments was included for comparison. The results of these sqRT-PCR 
e^qjerunents were consistent with the cDNA microarray relative expression data for all six genes 
evaluated (Figure 5A and 5B). As anticipated from ^e microarray results (Figure 4), HE4 ejqjression 
was consistently elevated in all fifteen tumor samples compared to lOSE reference cell-line and 
normal ovary (Figure 5A and 5B). Invariant chain genes, also known as CD74 and RGSl, were 

35 overesqpressed in the majority of tumors as indicated by noicroarray analysis (Figure 4). Both were 
also found to have increased e;q)ression in the majority of tumors as evaluated by sqRT-PCR (Figure 
5A and 5B). The expression of TOP2A was found to be highest in tiie reference lOSE RNA. 
Furthermore, compared with the expression level in the normal postmenopausal ovary, twelve of the 
fifteen tested tumor samples showed elevated and variable T0P2A gene e}q)tession. 
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Several meiribers of the immediate-early response cascade showed elevated expression in 
tumors as cong)ared to lOSE cells in the microairay experiments (as indicated by the notation * in 
Figure 4A and 4B); however, some of these genes have previously been shown to have lower 
expression in ovarian cancer compared to normal ovary (see Welsh et aL, Proc. Natl Acad, Sci. 
5 aSji.SS: 1176-1181, 2001, andWange^fl/,, Gene 229: 101-108, 1999). 

This discrepancy suggested tiiat the elevated relative ratio observed in these e3q)eriments 
maybe driven by low ejq>ression levels of these genes in lOSE cells grown in culture. In order to test 
this hypothesis, sqRT-PCR was used to compare ZFP36 (an immediate-early gene also known as 
G0S24 and TislJ) e3q)ression in tumors to that of normal ovaiy and lOSE cells. As suspected, 
10 normal ovary had one of the highest e:q)ressiQn levels of ZFP36, followed by that of the majority of 
tumors (Figure 5A), while the lowest e3q)ression level was observed in the lOSE cells. 

In addition to statistical analysis, multidimensional scaling (MDS) and hierarchical 
clustering techniques using a correlation metric and average linkage were used for evaluating overall 
gene e3q>ression (see Eisen et ah, Proc. Natl Acad, Sci. 95: 14863-14868, 1998). 

15 

EXAMPLE 3 

Identification of Additional Genes with Altered Expression in Ovarian Cancer 

20 This exan[5)le provides a description of how additional disclosed ovarian cancer-related 

nucleic acid molecules were identified These ovarian cancer-related molecules show differences in 
expression in subjects having ovarian cancer con:q)ared to expression in normal ovarian sur&ce 
epithelial cells. 

25 Using a different microarrays'and methods essentially similar to those desmbed above in 

Exan^le 1, thirty-one ovarian epithelial cancers were conq)ared to two normal postmenopausal 
ovarian samples. 141 additional ovarian cancer-related nucleic acid molecules were identified and 
further characterized (Tables 6 and 7, Addendum). 

30 Methods and Materials: 

Methods and materials were similar to those described in Example 1, except that different 
microarrays were used. The nucleic acids constituted 7,600 features, and representing different (non- 
redundant) transcripts including multiple known named genes and ESTs. The cDNA microarrays 
were constructed by Dr. Eric Chuang (Division of Radiation Oncology) at the Advanced Technology 

35 Center (Gaithersburg, MD 20877). The genes represented on these arrays are composed of 7,600 
cDNA clones and ESTs and are commercially available (Research Genetics, 2130 Memorial 
Parkway, HuntsviUle, AL 35801, U.S). 

The nucleic acid molecule ejq)ression patterns of thirty-one ovarian epithelial cancers were 
conq)ared to two normal postmenopausal ovarian saniples. The tissues were analyzed once, as the 
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coirelation coefficient from previously repeated array experiments was shown to be 0.92-0.95. Each 
tumor and normal sample was directly con^ared to a '^reference RNA" consisting of a mix of nine 
different hmnan cell lines (Stratagene, La JoUa, CA), allowing for indirect con5)arison of gene 
expression in tumors and normal ovarian san:Q)les. 
5 Hierarchcal clustering was performed as described above and as set forth in Eisen et al, 

Proc. Natl Acad, Set U.S,A. 95: 14863-8, 1998. 

Results 

Systematically Altered Genes 

10 Using these methods, two additional sets of nucleic acid molecules were identified that 

showed differential expression in subjects having ovarian cancer. Table 4 (see Addendum^provides 
a list of nucleic acid molecules lhat were found to be underexpressed in subjects having ovarian 
cancer, and their average gene log e:q)ression ratios. Table 5 (see Addendum) shows nucleic acid 
molecules that were found to be overexpressed in persons having ovarian cancer, and their average 

15 gene log e^qpression ratios. 

Genes undere>q)ressed in ovarian tumors (see Table 4) may represent potential tumor 
suppressors. The induction of the expression of these genes through therapeutic means, for instance 
by induction through drug or gene therapy, may slow tumor growth and/or increase tumor cell death. 
Among the 100 underexpressed genes were several oncogenes coding for proteins that are 

20 normally associated with the process of malignant transformation, 8ncluding yLW(SEQ ED NO: 137- 
138;, Krr(SEQ id NO: 298-299), and MAF (SEQ ID NO: 229-230). The lower expression of these 
genes in cancers con^ared to e?q)ression in non-cancerous subjects is unexpected and is believed to 
reflect novel effects unique to ovarian cancer. Additionally, CDKNl C (SEQ ID NO: 249), NBLl 
(SEQ ID NO: 273), and INGIL (SEQ ID NO: 322) are recognized tumor si^ressors, the 

25 dowhregulation of which may be involved in the process of tumor formation and/or progriessiori. 
TGF beta cascade members TGFBR3 (SEQ ID NO: 216-218) and EBAF (SEQ ID NO: 294) (both 
shown herem to be underexpressed in ovarian cancer) present potential interest in ligjit of the recent 
implication of the TGF beta pathway in norarnl and oral contraceptive-induced ovarian epithelial cell 
death and turnover (see Rodriguez et a/., J. Natl Cam Inst, 94(1): 50-60, 2002). Thus, 

30 downregulation of these nucleic acids may lead to inappropriate growth and possible transformation. 

Genes that wer6 overexpressed in ovarian tumors (Table 5) compared to normal tissue are 
believed to represent suitable targets for therapy and/or diagnosis, prognosis and staging of ovarian 
cancer. The decrease of the e^qjression of diese genes through therapeutic means, for instance by 
drug or gene therapy, presents a potential method of inhibition of ovarian cancer. 

35 Among the fifty-nine overcTqpressed gwies were several genes coding for proteins that are 

believed to be particularly promising as gene targets, including the following: iSLP/ (Secretory 
leukocyte protease inhibitor) (SEQ ID NO: 340-341); SPPl (Secreted phosphoprotein 1) (SEQ ID 
NO: 342); CKSl (CDC28 protein kinase 1) (SEQ ID NO: 345-347); ZFWAT(ZW10 interactor) (SEQ 
ID NO: 354); BF (B-factor, properdin) (SEQ ID NO: 343-344); MMP7 (Matrix metalloproteinase 7) 
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(SEQ ID NO: 348-349); FOLRl (Folate receptor 1) (SEQ ID NO: 364-365); KLK8 (Kallikrein 8) 
(SEQ ID NO: 368; CRIPl (Cysteine-rich proteia 1) (SEQ ID NO: 375; EYA2 (Eyes absent) (SEQ ID 
NO: 392-393); and PAX8 (Paired box gene 8) (SEQ ID NO: 350-351), 

SLPI is a particularly promising candidate as a potential ovarian cancer marker or detector. 
5 This protein has also been shown to be overexpressed in lung cancer (see Ameshima et al^ Cancer 
89(7): 1448-1456, 2000) and is detectable in Hie saliva, enabling non-invasive testing (see Sbugars et 
aLy Gerontology, 47(5): 246-253, 2001). MMP7 over-expression has been described in x>rimaiy and 
metastatic gastric cancers (see Mori et cd.. Surgery, 131(1 Pt 2): S39-S47, 2002) as well as colorectal 
carcinomas (see Ougolkov etah. Gastroenterology. 122(1): 60-71, 2002). MMP7 appears to be 

10 involved in new blood vessel formation, which is a prerequisite for tumor growth (see Nishizuka et' 
ah. Cancer Lett. 173(2): 175-182, 2001). SPPl (otherwise known as osteopondin) has also been 
associated with a number of malignancies (see Fedarko et ah, Clin, Cancer Res. 12: 4060-4066, 
2001) including a recent report showing higher e3q)ression in ovarian cancer (see Mok et aL, J, Natl 
Cancer Inst. 93(19) 1458-64, 2001). ZWINT is a newly discovered protein involved in kioetochore 

15 binding and centromere function (see Starr et al, J. Cell Sci. 1 13(Pt 11): 1939-1950, 2000). 

Properdin is involved in immune function and encodes complement factor B, a con^onent of the 
alternative pathway of complement activation. CRIPl is believed to be involved in zinc transport. 
Kallikrein.8 (also TADG14) is normally expressed in neural tissue, but appears to be altered such that 
it is highly eTcpressed in ovarian cancers (see Underwood et aL, Cancer Res. 59(17): 4435-4439, 

20 1999). EYA2, named for its involvement in eye development, is an ixnpoTtaaA developmental gene 
that is potentially inaportant in ovarian cancer. EYA2 is located on the 20ql3 chronoosomal locus, 
which is the most frequently amplified chromosome region in ovarian cancers (see Tanner et aL, 
Clin. Cancer Res. 5: 1833-1839, 2000). Other genes localized to ibst same 20ql3 diromosomal 
region are BMP7, which is also involved in development, and SLPI (discussed above), as well as HE4 

25 (identified in Exainple 1, above), all of which show higher expression in ovarian tumors. Thus; the * 
upregulation of these nucleic acids may in part be due to aioplification of 20ql3 in the tumors 
studied. 

PAX8 is involved in thyroid differentiation and normal function (see Pasca et aL, Proc. Natl. 
Acad, Sci, U.S.A, 97(24): 13144-13149, 2000). Furthermore, the folate receptor has been shown to be 

30 overejq)ressed in ovarian cancer (see Hough et al.. Cancer Res. 61(10): 3869-3876, 2001 and Bagnoli 
et aL, Oncogene, 19(41): 4754-4763, 2000). Finally, the specific pattern of caveolin (C4 VI) under- 
expression and Folate receptor {FOLRl) over-egression disclosed herein (see Tables 4 & 5, 
Addendum) is consistent with the reciprocal regulation of the expression of genes in ovarian cancer 
(see Bagnoli et aL, Oncogene, 19(41): 4754-4763, 2000). 

35 For each of the above specifically enmnerated genes, a survey of the Serial Analysis of Gene 

E^ession (SAGE) database (available through die UniGene search engine on the National Center 
for Biotechnology Information website) revealed that the e3q)ression of these genes is linuted to a 
relatively small number of tissues, including ovarian cancers and some other tumors, for instance 
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pancreatic or breast In addition, SLPI and SPPl are secreted proteins that may be detectable as a 
diagnostic marker in the serum of a subject 
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EXAMPLE 4 

Classification of a Tumor into BRCAl Ainkdd or jRftC12-linked tumor class. 

This example describes how to classify a tumor into a BRCAl-Uke or BRCA2'Tike tumor 
5 type using compoimd covariate prediction analysis. 

Class prediction can be perfonned using a Con[q)ound Covariate Predictor tDol, available as 
part of the BRB Array Tools software provided for download on the National Cancer Institute 
Intemet website. Detailed information about flie Compound Covariate Predictor is provided by the 
10 Biometric Research Branch, National Cancer Institute and can be found in the following technical 
reports listed at that site** McShane et al,, **Meaiods for assessing reproducibility of clustering 
patterns observed in analyses of nricroairay data" and Radmacher et a/., "A paradigm for class 
prediction using gene expression profiles." 

The con5>ound covariate predictor tool creates a multivariate predictor for one of two 
15 classes for each sample using markers in the multivariate predictor that are univariately significant at 
the selected significance cutoff for a given set of data (see discussion above in Section K D, 
"Compound Covariate Predictor Analysis:'). The statistical significance cutoff for a given set of 
data can be chosen based upon the level of confidence desired. 

By way of exanq)le, the markers provided in Table 10 satisfy a cutoff of P<0.0005, and are 
20 therefore suitable for use with conapound covariate predictor analysis. The multivariate predictor is a 
weighted linear combination of log-ratios for genes that are univariately significant The weight 
consists of the univariate t-statistics for conq)aring the classes. 

Using the compound covariate predictor and the markers provided in Table 10, a san^>le of 
ovarian tissue can be classified into a SJ?C4i-like or BRCA2ASkst tumor. Samples are prepared as 
25 described in Exaiaple 1, and logarithmic expression ratios obtained for each marker used in the " 
conq)ound covariate predictor analysis. 

The markers provided in Table 10 were used to segregate BRCAJ-liaksd md BRCAI-type 
sporadic tumor samples from 5i?C42-hnked and BRCA2-type sporadic samples, in a multivariate 
analysis. Based upon the information regarding tiiese classes that was obtained using other 
30 approaches (such as hierarchical clustering, see Example 1), conq)ound covariate predictor analysis 
classified the tumors with 92% accuracy (see Table 1 1). 

Using this method, an unknown tumor can be classified into one of any two groups provided 
fliat markers that are univariately significant at the selected significance cutoff for the desired groups 
are known. Li addition, flie gene expression data for the markers should be obtained using the same 
35 reference standard as the sairple tumor. 

Further analysis, such as a "leave-one-out" approach may be eu5)loyed to check the 
veracity of the conq)ound covariate predictor model. In this approach, each of the tumors is 
individually segregated, and the analysis conipleted using that tumor against the remaining sanq>les. 
in this way, the strength of the data set is measured against each individual saitiple (tumor), 
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confinning that the data set is useftil, independently of any individual sample. See Radnoacher et al^ 
"A paradigm for class prediction using gene expression profiles," available on the Biometric 
Research Branch, National Cancer Institute Internet site. 

5 EXAMPLES 

Expression of Ovarian Cancer-related Polypeptides 

This exanq)le describes how to e?qpress the ovarian cancer-related proteins disclosed heiein 
using various techniques. 

10 

The disclosed ovarian cancer-related proteins (and fragments thereof) can be expressed by 
standard laboratory technique. After expression, the purified ovarian cancer-related protein or 
polypeptide may be used for instance for fiinctional analyses, antibody production, diagnostics, 
prognostics, and patient therapy, eg., for prevention or treatment of ovarian cancer. Furthermore, the 

15 DNA sequences encoding the disclosed ovarian cancer-related proteins can be manipulated in studies 
to understand the expression of these genes and the function of their products. Mutant forms of 
human ovarian cancer-related proteins (and corresponding encoding sequences) may be isolated 
based upon information contained herein, and may be studied in order to detect alteration in 
expression patterns in terms of relative quantities, tissue specificity and functional properties of the 

20 encoded mutant ovarian cancer-related protein. Partial or full-length cDNA sequences fliat encode 
the subject protein may be ligated into bacterial expression vectors. Methods for 6?qpressing large 
amounts of protein from a cloned gene introduced into Escherichia coli (£. colij or other prokaryotes 
may be utilized for the purification, localization, and functional analysis of proteins. For exaniple, 
fusion proteins consisting of anoino terminal peptides encoded by a portion of the E. coli lacZot trpE 

25 gene linked to an ovarian cancer-related protein may be used to prepare polyclonal and monoclonal' 
• antibodies against these proteins. Thereafter, these antibodies may be used to purify proteins by 
immunoaffinity chronaatography, in diagnostic assays to quantitate the levels of protein and to 
localize proteins in tissues and individual cells by inmiunofluorescence. 

Intact native protein may also be produced in E. coli in large amounts for functional studies. 

30 Methods and plasmid vectors for producing fusion proteins and intact native proteins in bacteria are 
described in Sambrook et al {hi Molecular Cloning: A Laboratory Manual, Ch. 17, CSHL, New 
York, 1 989). Such fusion proteins may be made in large amounts, are easy to purify, and can be used 
to elicit antibody response. Native proteins can be produced in bacteria by placing a strong, regulated 
promoter and an efficient ribosome-binding site iqpstreamof the cloned gene. If low levels of protein 

35 are produced, additional steps may be taken to intnrease protein production; if hig|h levels of protein 
are produced, purification is relatively easy. Suitable methods are presented in Sambrook et al (In 
Molecular Cloning: A Laboratory Manual, CSHL, New York, 1989) and are well known in the art. 
Often, proteins expressed at high levels are found in insoluble inclusion bodies. Methods for 
extracting proteins from these aggregates are described by Samibrook et al (In Molecular Cloning: A 
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Laboratory Manual, Ch. 17. CSHL, New York, 1989). Vector systems suitable for the expression of 
lacZ fusion genes include the pUR series of vectors (see Ruther and Muller-Hill, EMBO / 2: 1791, 
1983), pEXl-3 (see Stanley and Luzio, EMBO J. 3:1429, 1984) andpMRlOO (see Gray et al, Proc. 
Natl Acad. Set USA 79:6598, 1982). Vectors suitable for the production of intact native proteins 
5 include pKC30 (see Shimatake and Rosenberg, Nature 292:128, 1981), pKK1177-3 (see Amann and 
Brosius,Ge7ic40:183, 1985)andpET-3(seeStudiarandMof&tt /.Moi^/o/. 189:113, 1986). 
Fusion proteins, for instance fusions that incorporate a portion of an ovarian cancer-related protein, 
may be isolated from protein gels, lyophiUzed, ground into a powder and used as an antigen. The 
DNA sequence can also be transferred from its existing context to other cloning vehicles, such as 

1 0 other plasmids, bacteriophages, cosmids, animal viruses and yeast artificial chromosomes (YACs) 
(see Burke et al. Science 236:806-812, 1987). These vectors may then be introduced into a variety 
of hosts including sonoatic cells, and simple or complex organisms, such as bacteria, fungi (see ' 
Timberlake and Marshall, Science 244:1313-1317, 1989), invertebrates, plants (see Gasser and 
Fraley, Science 244:1293, 1989), and animals (see Pursel et al, Science 244:1281-1288, 1989), vMch 

IS cell or organisms are rendered transgenic by the introduction of the heterologous ovarian cancer- 
related cDNA. 

For expression in mammalian cells, the cDNA sequence may be ligated to heterologous 
promoters, such as the simian vims (SV) 40 promoter in the pSV2 vector (see Mulligan and Berg, 
Proa Natl Acad. Set USA l^\2{fll'lQ16, 1981), and introduced into cells, such as monkey COS-1 

20 cells (see Gluzmai^ Cell 23:175-182, 1981), to achieve transient or long-term expression. The stable 
integration of the chimeric gene construct may be maintained in tnammallgn cells by biochemical 
selection, for exan^le with neomycin (see Southern and Berg, J. Mol Appl Genet. 1: 327-341, 1982) 
or mycophenolic acid (see Mulligan and Berg, Proc. Natl Acad, ScL USA 78: 2072-2076, 1981). 
DNA sequences can be manipulated with standard procedures such as restriction enzyme 

25 digestion; fOl-in with DNA polymerase, deletion by exonuclease, extension by teiin^ ' ~ 

deoxynucleotide transferase, Ugation of synthetic or cloned DNA sequences, site-directed sequ^ce- 
alteration via single-stranded bacteriophage intermediate or with the use of specific oligonucleotides 
in combination with PGR. 

The cDNA sequence (or portions derived firom it) or a mini gene (a cDNA with an intron 

30 and its own promoter) may be introduced into eukaryotic expression vectors by conventional 

techniques. These vectors are designed to permit the transcription of the cDNA in eukaryotic cells by 
providing regulatory sequences that initiate and enhance the transcription of the cDNA and ensure its 
proper splicing and polyadenylatioa Vectors containing the promoter and enhancer regions of the 
SV40 or long terminal repeat (LTR) of die Rous Sarcoma virus and polyadenylation and splicing 

35 signal &om SV40 are readily available (see Mulligan et al, Proc, Natl Acad, ScL USA 

78:1078-2076, 1981; Gorman etal, Proc. Natl Acad Sci USA 78:6777-6781, 1982). The level of 
expression of the cDNA can be manipulated with this type of vector, either by using promoters that 
have different activities (for exantple, the baculovirus pAC373 can e3q)ress cDNAs at high levels in 
S.fingiperda cells (see Summers and Smith, In Genetically Altered Viruses and the Emnrorment^ 
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Fields et al (Eds.) 22:3 19-328, CSHL Press, Cold Spring Harbor, New York, 1985) or by using 
vectors that contain promoters amenable to modulation, for exaiiq)le, the glucocorticoid-responsive 
promoter from the mouse mammary tumor virus (see Lee et al, Nature 294:228, 1982). The 
expression of the cDNA can be monitored in the recipient cells 24 to 72 hours after introduction 
5 (transient expression). 

In addition, sonoe vectors contain selectable markers such as the gpt (see Miilligan and Berg, 
Proc. Natl Acad. Sd, USA 78:2072-2076, 1981) or neo (see Southern and Berg, 7. MoL Appl Genet 
1:327-341, 1982) bacterial genes. These selectable markers permit selection of transfected cells that 
exhibit stable, long-term expression of the vectors (and Oerefore die cDNA). The vectors can be 

10 m ai nt a ine d in the cells as episomal, freely repUcating entities by using regulatory elements of viruses 
such as papilloma (see SarvereM/1, Mo/. CellBiol 1:486, 1981)orEpstein-Barr(seeSugdenera/., 
Mol Cell Biol 5:410, 1985). Alternatively, one can also produce cell lines that have integrated the 
vector uito genomic DNA. Both of these types of cell lines produce the gene product on a continuous 
basis. One can also produce cell lines that have anoplified the number of copies of the vector (and 

15 therefore of the cDNA as well) to create cell lines that can produce high levels of the gene product 
(see Alt et al, J, Biol Chem. 253:1357, 1978). 

The transfer of DNA into eukaryotic, in particular human or other mammalian cells, is now 
a conventional technique. The vectors are introduced into the recipient cells as pure DNA 
(transfection) by, for example, precipitation with calcium phosphate (see Graham and vander Eb, 

20 Virology 52:466, 1973) or strontium phosphate (see Brash et al„ Mol Cell Biol 7:2013, 1987), 

electroporation (see Neumann et al, EMBOJ 1:841, 1982), lipofection (see Feigner et al, Proc. Natl 
Acad Sci USA 84:7413, 1987), DEAE dextran (see McCuthan et al, 1 Natl Cancer Inst 41:351, 
1968), microinjection (see Mueller et al, Cell 15:579, 1978), protoplast fusion (see Schafiier, Proc. 
Nati. Acad. ScL USA 77:2163-2167, 1980), or pellet guns (see Klein et al., Nature 327:70, 1987). 

25 Altanatively, flie cDNA, or fragments thereof, can be introduced by infection with vinis vectois. 
Systems are developed that use, for example, retroviruses (see Bernstein et al. Gen. Engr'g 7:235, 
1985), adenoviruses (see Ahmad c/., J. Virol 57:267, 1986), or Herpes virus (see Spaete et al. Cell 
30:295, 1982). MBl encoding sequences can also be delivered to target cells in vitro via non- 
infectious systems, for instance liposomes. 

30 These eukaryotic expression systems can be used for studies of ovarian cancer-related 

nucleic acids (such as those listed in Table 1) and mutant forms of these molecules, as well as ovarian 
cancer-related proteins and mutant forms of these protein. Such uses include, for exang)le, the 
identification of regulatory elements located in the 5* region of ovarian cancer-related genes on 
genomic clones tiiat can be isolated from human genomic DNA libraries. The eukaryotic expression 

35 systems may also be used to study the function of the normal ovarian cancer-related proteins, specific 
portions of these proteins, or of naturally occurring or artificially produced mutant versions of 
ovarian cancer-related proteins. 

Using the above techniques, the expression vectors containing ovarian cancer-related gene 
sequence or cDNA, or fiagments or variants or mutants thereof can be mtroduced into human cells* 
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mammalian cells from other species or non-inammalian cells as desired. The choice of cell is 
determined by the purpose of the treatment. For example, monkey COS cells (see Gluzman, Cell 
23:175-182, 1981) that produce high levels of the SV40 T antigen and permit the replication of 
vectors containing the SV40 origin of replication maybe used. Similarly, Chmese hamster ovary 

5 (CHO), mouse NIH 3T3 fibroblasts or human fibroblasts or lyn9}hoblasts may be used 

The present disclosure thus enconpasses recombinant vectors that conptise all or part of an 
ovarian cancer-related gene or cDNA sequence (eg., those listed in Table 1), for expression in a 
suitable host In some embodimaits, the ovarian cancer-related nucleic acid sequence is operatively 
linked in the vector to an e}q>ression control sequence to form a recombinant DNA molecule, so that 

10 the ovarian cancer-related polypeptide can be expressed The expression control sequence may be 
selected from the groiq> consisting of sequences that control the expression of genes of prokaryotic or 
eukaryotic cells and their viruses, and combinations thereof. The expression control sequence may be 
specifically selected from the group consisting of the lac system, the trp system, the tac system, the 
trc system, major operator and promoter regions of phage lambda, the control region of fd coat 

1 5 protein, the early and late promoters of S V40, promoters derived from polyoma, adenovirus, 

retrovirus, baculovinis and simian virus, the promoter for 3-phosphoglycerate kinase, the promoters 
of yeast acid phosphatase, the promoter of the yeast alpha-mating factors, and combinations thereof 
The host cell, which may be transfected with the vector of this disclosure, may be selected 
&om the group consisting of £. cofi, PseudomonaSy Bacillus subtilis, B. stearothermophilus or other 

20 bacilli; other bacteria; yeast; fungi; insect; mouse or other animal; or plant hosts; or human tissue 
cells. 

It is appreciated that for mutant or variant ovarian cancer-related DNA sequences, similar 
systems are en^loyed to express and produce the mutant product. In addition, fragments of an 
ovarian cancer-related protein can be expressed essentially as detailed above. Such fragments include 
25 individual ovarian cancer-related protein domains or sub-domains, as well as shorter fragments such 
as peptides. Ovarian cancer-related protem fi:agments (e,g,^ those having therapeutic properties) may 
be e?q;)ressed in this manner also. 

EXAMPLE 6 

30 Suppression of Ovarian Cancer-related Increased Gene Expression 

This example describes how the ovarian cancer-related xmcleic acids disclosed herein may 
be suppressed using various techniques. 

35 A reduction of ovarian cancer-related protein e^qpression in a transgenic cell may be obtained 

by introducing into cells an antisense construct based on an ovarian cancer-related protein encoding 
sequence, such as a cDNA or gene sequence or flanking regions thereof of any one of the proteins 
encoded by the nucleic acid molecules listed in Table 1, Table 9 or elsewhere herein. For antisense 
suppression, a nucleotide sequoice encoding an ovarian cancer-related protein tiiat is overexpressed 
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in ovarian cancer, e.g. all or a portion of the small cell lung carcinoma cluster 4 antigen {CD24) (SEQ 
ID NO: 181-182), secretory leukocyte protease inhibitor antileukoproteinase {SLPI) (SEQ ID NO: 
340-341), secreted phosphoprotein 1 {SPPl) (SEQ ID NO: 342), B-factor, properdin {BF) (SEQ ID 
NO: 343-344), 'liomolog of Cksl=p34Cdc28/Cdc2-associated protein" {CKSl) (SEQ ID NO: 345- 
5 347), matrix metalloproteinase 7 {MMP7) (SEQ ID NO: 348-349), paired box gene 8 (PAX8) (SEQ 
ID NO: 350-351), serine protease inhibitor, Kunitz type, 2 (SPINT2) (SEQ ID NO: 352-353), ZWIO 
interactor (ZWINT) (SEQ ID NO: 354), diacylglycerol kinase (DGKH) (SEQ ID NO: 355), high- 
mobility ffoup (nonhistone chromosomal) protein isoforms I and Y {HMGTY) (SEQ ID NO: 356), 
Syndecan-4 - amphiglycan - ryudocan core protein (SDC4) (SEQ ID NO: 357-359), cyclin-dependent 

10 kinase inhibitor 2A (CDKN2A) (SEQ ID NO: 360), sodiwn channel, nonvoltag&-gated 1 alpha 

{SCNNJA) (SEQ ID NO: 361-362), lactate dehydrogenase A {LDHA) (SEQ ID NO: 363), adult folate 
receptor (FOLRI) (SEQ ID NO: 364-365), Triosephosphate isomerase 1 (TPIl) (SEQ ID NO: 366- 
367), kallikrein 8 (neuropsin/ovasin) (KLKS) (SEQ ID NO: 368), CXC chemokine receptor 4- fiisin- 
neuropeptide Y receptor-L3 (CXCR4) (SEQ ID NO: 200), kinesin-like 1 (KNSLI) (SEQ ID NO: 369- 

15 370), H2A histone family, member O {H2AFO) (SEQ ID NO: 37 1-372), major histocompatibility 
con5>lex, class II, DR alpha, HLA-DRA, cysteine-rich protein 1 (intestinal) (CRIPI) (SEQ ID NO: 
375), pyrophosphatase (inorganic), (PP) (SEQ ED NO: 376), EST 666391, glucose transporter 
(HepG2) (SLC2A1) (SEQ ID NO: 379-381), EST 897770, hepatoma-derived growth factor {HDGF) 
(SEQ ED NO: 383-385), argininosuccinate synthetase {ASS) (SEQ ID NO: 386), claudin4 (CXZ)J\AO 

20 (SEQ ID NO: 387-388), preferentially expressed antigen in melanoma {PRAME) (SEQ ID NO: 389), 
LAR= LCA-homologue (PTPRF) (SEQ ID NO: 390-391), eyes absent (Drosophila) homolog 2 
{EYA2) (SEQ ID NO: 392-393), L-myc (MYCLJ) (SEQ ID NO: 394-396), STATl-IFN alpha^eta- 
lesponsive transcription factor ISGF3 beta subunits (p91/p84) (STATl) (SEQ ID NO: 397-399), 
mitochondrial carrier homolog 2 (MTCH2) (SEQ ID NO: 400-401), 5-hydroxyttyptamine (serotonin) 

25 receptor 3A (HTRSA) (SEQ ID NO: 402), cyclinEl (CCNEl) (SEQ ID NO: 403-404), cadheriri 6, 
type 2, K-cadherin (fetal kidney) {CDH6) (SEQ ID NO: 405), 5*-AMP-activated protein kinase - 
gamma-1 subunit (PRKAGl) (SEQ ID NO: 406-408), defensinbeta 1 (DEFBl) (SEQ ID NO: 409), 
actin related protein 2/3 complex, subunit 1 A (41 kD) (ARPCIB) (SEQ ID NO: 410-41 1), PKC 
iota=Protein kinase C, iota (PRKCI) (SEQ ID NO: 412-414), glyceraldehyde-3-phosphate 

30 dehydrogenase {GAPD) (SEQ ID NO: 415), complement component 2 (C2) (SEQ ID NO: 416-417), 
H2A histone family, member Y {H2AFY) (SEQ ID NO: 418-419), transmembrane 4 superfamily 
member 1 {TM4SF1) (SEQ ID NO: 420-421), gIyceraldehyde-3-phosphate dehydrogenase {GAPD) 
(SEQ ID NO: 422-423), Interferon-inducible protein 1-8U (IFITM3) (SEQ ID NO: 424-426), glycine 
dehydrogenase (decarboxylating; glycine decarboxylase, glycine cleavage system protein?) (GLDQ 

35 (SEQ ID NO: 427-428), calumenin (CALU) (SEQ ID NO: 429-430), hemoglobin alpha 2 (HBA2) 
(SEQ ED NO: 431-432), SlOO calcium-binding protein All (calgizzarin) (SlOOAll) (SEQ ID NO: 
433), Lactate dehydrogenase A (LDHA) (SEQ ID NO: 434-436), ubiquitin-conjugating enzyme E2C 
(UBE2Q (SEQ ED NO: 437), E2F-3=pRB-binding transcription factoi=KIAA0075 {E2F3) (SEQ ID 
NO: 438-440), E-cadherin (CDHJ) (SEQ ID NO: 441-442), proteasome (prosome, macropain) 
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activator subunit 2 (PA28 beta) {PSME2) (SEQ ID NO: 443-444), OP-l=osteogeiiic protein in the 
TGF-beta fanuly {BMP?) (SEQ ID NO: 445-447), and topoisomerase H (JOP2A) (SEQ ID NO: 448) 
cDNA or gene, is arranged in reverse orientation relative to the promoter sequence in the 
transformation vector. Other aspects of the vector may be chosen as for any other expression vector 
5 {see, Exanople 4). 

The introduced sequence need not be a fidl-leo^ human ovarian cancer-related cDNA or 
gene, and need not be exactly homologous to the equivalent sequence found in the cell type to be 
transformed. Generally, however, where the introduced sequence is of shorter length, a higiher degree 
of homology to the ovaiian cancer-related sequence likely will be needed for effective antisense 
10 suppression The introduced antisense sequence in the vector may be at least thirty nucleotides in 
length, and improved antisense suppression will typically be observed as the length of the antisense 
sequence increases. The length of the antisense sequence in the vector advantageously may be 
greater than 1 00 nucleotides. 

Although the exact mechanism by which antisense RNA molecules interfere with gene 
15 expression has not been elucidated, it is believed that antisense RNA molecules bind to the 
endogenous mKNA molecules and thereby inhibit translation of the endogenous mRNA. 

Suppression of endogenous ovarian cancer-related gene expression can also be achieved 
using ribozymes. Ribozymes are synthetic RNA molecules that possess highly specific 
endoribonuclease activity. The production and use of libozymes are disclosed in U.S. Patent No. 
20 4,987,07 1 to Cech and U.S. Patent No. 5,543,508 to Haselhoff. The inclusion of ribozyme sisquences 
within antisense RNAs may be used to confer RNA cleaving activity on the antisense RNA, such that 
endogenous mRNA molecules that bind to the antisense RNA are cleaved, which in turn leads to an 
enhanced antisense inhibition of endogenous gene e}q;>ression. 

In addition, dominant negative mutant forms of the disclosed ovarian cancer-related sequences 
25 may be used to block endogenous activity oftbe corresponding gene products^ 

Suppression can also be achieved using small inhibitory RNA molecules (siRNAs) (see, for 
instance, Caplen et al, Proc. Natl Acad. Set 98(17): 9742-9747, 2001, and Elbashir et al. Nature 
41 1: 494-498, 2001). Thus, this disclosure also enconcpasses siRNAs that correspond to an ovarian 
cancer-related nucleic acid, which siRNA is capable of suppressing the expression or function of its 
30 cognate (target) ovarian cancer-related protein. Also encompassed are methods of suppressing the 
expression or activity of an ovarian cancer-related molecule using an siRNA. 

Suppression of expression of an ovarian cancer-related gene can be used, for instance, to treat, 
reduce, or prevent cell proliferative and other disorders caused by over-expression or unregulated 
expression of ttie corresponding ovarian cancer-related gene. Inparticolai, suppression of expression 
35 of sequences disclosed hexebi as being up-regulated in ovarian cancer can be used to treat, reduce, or 
prevent progression to a later stage of ovarian cancer. 
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EXAMPLE? 
Nucleic Acid-Based Analysis 

This exanq)le describes how to use the ovarian cancer-related nucleic acids disclosed herein 
S to detect and analyze neoplasms and mutations in ovarian cancer-related nucleic acids that may result 
in neoplasms. 

The ovarian cancer-related nucleic acid n:K>lecules provided herein, and combinations of 
these molecules, can be used in methods of genetic testing for neoplasms (e.g., ovarian or other 

10 cancers) or predisposition to neoplasms owing to altered eiqpression of ovarian cancer-related nucleic 
acid molecules {e.g., deletion, genomic an:iplification or mutation, or ov^- or under-expression in 
comparison to a control or baseline). For such procedures, a biological sanq>le of the subject, which 
biological sample contains either DNA or RN A derived from the subject is assayed for a mutated, 
anq)lified or deleted ovarian cancer-related nucleic acid molecule, or for over- or under-expression of 

15 an ovarian cancer-related nucleic acid molecule. Suitable biological sanq>le5 include samples 

containing genomic DNA or RNA (including rtiRNA), obtained from body cells of a subject, such as 
those present in peripheral blood, urine, sahva, tissue biopsy, surgical specimen, amniocentesis 
sanG^les and autopsy material. 

The detection in the biological sanq}le of a mutant ovarian cancer-related nucleic acid 

20 molecule, a mutant ovarian cancer-related RNA, an ancplified or homozygously or heterozygously 
deleted ovarian cancer-related nucleic acid molecule, or over- or under-expression of an ovarian 
cancer-related nucleic acid molecule, may be performed by a number of methodologies, exan:q)les of 
which are provided. 

25 A. Detection of Unknown Mutations: 

Unknown mutations in ovarian cancer-related nucleic acid molecules can be identified 
through polymerase chain reaction anplification of reverse transcribed RNA (RT-PCR) or DNA 

isolated from breast or ovary or other tissue, followed by direct DNA sequence determination of the 
products; single-strand conformational polymorphism analysis (SSCP) (for instance, see Hongyo et 
30 a/.. Nucleic Acids Res, 21 :3637-3642, 1993); chemical cleavage (including HOT cleavage) (Bateman 
et al. Am, J, Med. Genet, 45;233-240, 1993; reviewed in Ellis et ah. Hum. Mutat 11:345-353, 1998); 
denaturing gradient gel electrophoresis (DGGE); ligation amplification mismatch protection 
(LAMP); or enzymatic mutation scanning (Taylor and Deeble, Genet Anal 14:181-186, 1999), 
followed by direct sequencing of amplicons with putative sequence variations. 

35 

B. Detection of Known Mutations: 

The detection of specific known DNA mutations in ovarian cancer-related nucleic acid 
molecules may be achieved by methods such as hybridization using allele q)ecific oligonucleotides 
(ASOs) (see Wallace et al, CSHL Symp. Quant. Biol 51:257-261, 1986), direct DNA sequencing 
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(see Church and Gilbert Proc. Natl Acad, Sci. USA 81:1991-1995, 1988), the use of restriction 
enzymes (see Flavell et al, Cell 15:25, 1978; Geever et al, Proc, Natl Acad, Sci. USA, 8(8): 5081- 
5085, 1981), discrimination on the basis of electrophoretic mobility in gels with denaturing reagent 
(see Myers and Maniatis, Cold Spring Harbor iSy/zi;?. Quant Biol 51:275-284, 1986), RNase 
5 protection (see Myers et al, Science 230:1242, 1985), chemical cleavage (see Cotton et al, Proc. 
Natl Acad, Set USA 85:4397-4401, 1985), and the Ugase-mediated detection procedure (see 
Landegren et al. Science 241:1077-1080, 1988). OUgonucleotides specific to normal or maxtant MBl 
sequences can be chemically synthesized using commercially available machines. These 
oligonucleotides can then be labeled radioactively mfh isotopes (such as ^P) or non-radioactively, 

10 with tags such as biotin (see Ward and Langer et al, Proc. Natl Acad. Sci. USA 78:6633-6657, 

198 1), and hybridized to individual DNA sanq)les immobilized on menihranes or oflier solid supports 
by dot-blot or transfer from gels after electrophoresis. These specific sequences are visualized by 
methods such as autoradiography or fluorometric (see Landegren et al. Science 242:229-237, 1989) 
or colorimetric reactions (see Gebeyehu et al. Nucleic Acids Res. 15:4513-4534, 1987). Using an 

15 ASO specific for a normal allele, the absence of hybridization would indicate a mutation in the 

particular region of the gene, or deleted MBl gene. In contrast, if an ASO specific for a mutant allele 
hybridizes to a clinical sample then that would indicate the presence of a mutation in the region 
defined by the ASO. 

^0 C. Detection of Genomic Amplificatioii or Deletion 

Gene dosage (copy number) can be in5)ortant in neoplasms; it is therefore advantageous to 
determine the number of copies of ovarian cancer-related nucleic acids in biological sanq}les of a 
subject, e.g., serum or ovary samples. Probes generated firom flie disclosed encodii]^ sequence of in 
ovarian cancer-related nucleic acid molecules can be used to mvestigate and measure genomic dosage 

25 ofthe corresponding ovarian cancer-related genomic sequence. " . . . 

Appropriate techniques for measuring gene dosage are known in the art; see for instance, US 
Patent No. 5,569,753 ("Cancer Detection Probes") and Pinkel et al {Nat Genet. 20:207-21 1, 1998) 
("High Resolution Analysis of DNA Copy Number Variation using Comparative Genomic 
Hybridization to Microarrays"). 

30 Determination of gene copy number in cells of a patient-derived sample using other 

techniques is known in the art For example, amplification of an ovarian cancer-related nucleic acid 
sequence in cancer-derived cell lines as well as uncultured ovarian cancer or other cells can be carried 
out using bicolor HSH analysis. By way of example, interphase FISH analysis of breast cancer cell 
lines can be carried out as previously described (see Barlund et al. Genes Chroma. Cancer 20:372- 

35 376, 1997). The hybridizations can be evaluated using a Zeiss fluorescence microscope. 

For tissue microarrays, the FISH can be performed as described in Kononen et al (Nat 
Med. 4:844-847, 1998). Briefly, consecutive sections ofthe array are deparafSnized, dehydrated in 
eflianol, denatured at 74* C for 5 minutes in 70% formamide/2 x SSC, and hybridized with test and 
reference probes. The specunens containing tight clusters of signals or >3-fold increase in the 
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number of test probe as conq)ared to chromosome 17 centromere in at least 10% of the tumor cells 
may be considered as an^lified. Microarrays can be constructed as described in WO 99/44063 A2 
andWO99/44062Al. 

5 C. Detection of mRNA Expression Levels 

Altered expression of an ovarian cancer-related molecule also can be detected by measuring 
tiie cellular level of ovarian cancer-related nucleic acid molecule-specific niRNA. mRNA can be 
measured using techniques well known in the art, including for instance Northern analysis, RT-PCR 
and mRNA in situ hybridization. Details of mRNA analysis procedures can be found, for instance, in 
10 Exmple 1, Exsaxxple 3, and Sambrook et al. (ed), Molecular Qoning: A Laboratory Manual, 2nd 
ed., vol. 1-3, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 

The nucleic acid-based diagnostic methods of this disclosure are predictive of ovarian 
cancer. Cells of any tumors that demonstrate alt^ed egression levels (eg., through genomic 
amplification, deletion, mutation, or other over- or under-expression) of nucleotide sequences that 
15 share homology with the ovarian cancer-related nucleic acids disclosed herein are aggressive tumor 
cells, and result in decreased survival, increased metastasis, increased rates of clinical, and overall 
worsened prognosis. 

EXAMPLE 8 

20 Production of Protein Specific Binding Agents 

This example describes how to use the ovarian cancer-related molecules disclosed herein to 
produce binding agents useful in preventing ovarian cancer. 

25 Monoclonal or jwlyclonal antibodies may be produced to any of the disclosed ovarian 

cancer-related proteins, or mutant forms of fliese proteins. Optimally, antibodies raised against these 
proteins, or peptides fi:om within such proteins, would specifically detect the protein or peptide wifii 
which the antibodies are generated. That is, an antibody generated to the BMP7 protein or another 
specified protein (see Table 1) or a fi-agment thereof would recognize and bind that protein and would 

30 not substantially recognize or bind to other proteins found in human cells. 

The determination that an antibody specifically detects a designated protein {e.g. , an ovarian 
cancer-related protein as disclosed herein) can be made by any one of a number of standard 
immunoassay methods; for instance, the Westem blotting technique (see Sambrook et al, In 
Molecular Cloning: A Laboratory Manual, CSlIL,l<fev/ York, 1989). To determine that a given 

35 antibody preparation (such as one produced in a mouse) specifically detects a designated protein by 
Westem blotting, total cellular proteins are extracted firom cells (for cxmplo, human ovary) and 
electrophoresed on a sodium dodecyl sul&te-polyacrylamide gel. The proteins are then transferred to 
a membrane (for exan5)le, nitrocellulose) by Westem blotting, and the antibody preparation is 
incubated with the membrane. After washing the membrane to remove non-specifically bound 
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antibodies, the presence of speciJScally bound antibodies is detected by the use of an anti-mouse 
antibody conjugated to an enzyme such as alkaline phosphatase. AppHcation of an alkalme 
phosphatase substrate 5-bromo-4-chloro-3-indolyl phosphate/nitro blue tetrazolium results in the 
production of a dense blue conq)oimd by immunolocalized alkaline phosphatase. Antibodies that 
5 specifically detect the designated protein will, by this technique, be shown to bind to the designated 
protein band (which will be localized at a given position on the gel determined by its molecular 
weight). Non-specific binding of the antibody to other proteins may occur and may be detectable as a 
weak signal on the Western blot The non-specific nature of fliis biziding will be recognized by one 
skilled in the art by the weak signal obtained on &e Western blot relative to the strong primary signal 

10 arising £rom the specific antibody-protein binding. 

Substantially pure ovarian cancer-related protein or protein fiiagment (peptide) suitable for 
use as an immunogen may be isolated from transfected or transformed cells, as described above. 
Concentration of protein or peptide in the final preparation is adjusted, for example, by concentration 
on an Amicon filter device, to the level of a few micrograms per milliliter. Monoclonal or polyclonal 

1 5 antibody to the protein can then be prepared as follows: 

A. Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of a designated protein (such as an ovarian cancer-related 
protein, including one encoded by a nucleic acid Hsted in Table 1) identified and isolated as described 

20 can be prepared firom murine hybridomas according to the classical method of Kohl^ and Milstein 
{Nature 256:495-497, 1975) or derivative methods thereof. Briefly, a mouse is repetitively 
inoculated with a few micrograms of the selected protein over a period of a few weeks. The mouse is 
then sacrificed, and tiie anfibody-producing cells of the spleen isolated The spleen cells are fiised by 
means of polyethylene glycol with mouse myeloma cells, and the excess un-fiised cells destroyed by 

25 growth of the system on selective media conq)rising aminopterin (HAT media). The successfully 
fiised cells are diluted and aliquots of the dilution placed in wells of a microliter plate where growth 
of fibe culture is continued. Antibody-producing clones are identified by detection of antibody in the 
supernatant fluid of the wells by immunoassay procedures, such as ELISA, as originally described by 
Engvall {Meth. Enzymoi 70: 419-439, 1980), and derivative methods thereof. Selected positive 

30 clones can be expanded and their monoclonal antibody product harvested for use. Detailed 

procedures for monoclonal antibody production are described in Harlow and Lane (Antibodies, A 
Laboratory Manual, CSHL, New York, 1988). 

B. Polyclonal Antibody Production by Immunization 

35 Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single protein can 

be prepared by immunizing suitable animals wifli the expressed protein, which can be unmodified or 
modified to enhance immunogenicity. Effective polyclonal antibody production is affected by many 
factors related both to the antigen and the host species. For exan^le, small molecules tend to be less 
immunogenic flian others and may require the use of carriers and adjuvant. Also, host animals vary 
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in response to site of inoculations and dose, with either inadequate or excessive doses of antigen 
resulting in low titer antisera. SmaU doses (ng level) of antigen administered at multiple intradermal 
sites appear to be most reliable. An effective immunization protocol for rabbits can be found in 
Vaitukaitis etal. (J. Clin. Endocrinol Metab. 33: 988-991, 1971). 
5 Booster injections can be given at regular intervals, and antiserum harvested when antibody 

titer thereof, as determined semi-quantitatively, for example, by double immunodifiiision in agar 
against known concentrations of the antigen, begins to faU. See, for exan5)le, Ouchterlony et al (In 
Handbook of Experimental Immunology, Wist {e±) Chapter 19. Blackwell, 1973). Plateau 
concentration of antibody is usually in the range of about 0.1 to 02 mg/ml of serum (about 12 |jM). 
10 Affinity of the antisera for the antigen is determmed by preparing con?)etitive binding curves, as 
described, for exanqjle, by Fisher (Manual of Clinical Immunology, Ol 42, 1980). 

C- Antibodies Raised against Synthetic Peptides 

A third approach to raising antibodies against the subject oyarian cancer-related proteins or 
15 peptides is to use one or more synthetic peptides synthesized on a commercially available peptide 
synthesizer based upon the predicted amino acid sequence of die desired ovarian cancer-related 
protein or peptide. 



D. Antibodies Raised by Injection of Protein Encoding Sequence 

20 Antibodies also may be raised against proteins and peptides related to ovarian cancer as 

described herein by subcutaneous injection of a DNA vector that expresses the desired ovarian 
cancer-related protein, or a fragment thereof into laboratory animals, such as mice. DeUveiy of the 
recombinant vector into the animals may be achieved usmg a hand-held form of the Biolistic system 
(see Sanford et al, Particulate Set Technol Sai-il, 1987) as described by Tang et al {Nature 

25 . 356:152-154, 1992). Expression vectors suitable for this purpdse ttiay include those that eiq)ress the 
ovarian cancer-related sequence under the transcriptional control of either the human P-actin 
promoter or the cytomegalovirus (CMV) promoter. 

Antibody preparations prepared according to these protocols are useful m quantitative 
immunoassays that determine concentrations of antigen-bearing substances in biological sanq)les; 

30 they also can be used semi-quantitatively or qualitatively to identify the presence of antigen in a 
biological sanq)le; or for immunolocalization of the corresponding ovarian cancer-related protein. 

For administration to human patients, antibodies, e.g., ovarian cancer-related protein specific 
monoclonal antibodies (such as antibodies to the proteins encoded by the encoding sequences listed 
to in Table 1), can be humanized by methods known in the art. Antibodies with a desired binding 

35 specificity can be commercially humanized (Scotgene, Scotland, UK; Oxford Molecular, Palo Alto, 
CA). Alternatively, human antibodies can be produced. Methods for producing human antibodies 
are known in flie art; see, for instance, Canevari et al, hit J. Biol Markers 8:147-150, 1993 and 
Green, J. Immunol Metk. 231:1 1-23, 1999, for instance. 
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EXAMPLE 9 
Protein-Based Analysis 

This exaniple describes how to use the ovarian cancer-related molecules disclosed herein to 
5 quantitate the level ofoneormore ovarian cancer-related proteins in a subject. 

An alternative meHiod of diagnosing, staging, detecting, or predicting ovarian cancer is to 
quantitate the level of one or more ovarian cancer-related proteins in a subject, for instance in tiie 
cells of the subject. This diagnostic tool is useful for detecting reduced or increased levels of ovarian 

1 0 cancer-related proteins. Localization and/or coordinated egression (terq)orally or spatially) of 
ovarian cancer-related proteins can also be examined using well known techniques. The 
determination of reduced or increased ovarian cancer-related protein levels, in comparison to such 
expression in a normal subject (e.g., a subject not having ovarian cancer or not having a 
predisposition developing this condition, disease or disorder, would be an alternative or supplemental 

1 5 approach to the direct determinaticn of ovarian cancer-related nucleic acid levels by the methods 
outlined above and equivalents. The availability of antibodies specific to specific ovarian cancer- 
related protein(s) will facilitate the detection and quantitation of cellular ovarian cancer-related 
protein(s) by one of a number of immunoassay methods which are well known in the art and axe 
presented in Harlow and Lane (Antibodies, A Laboratory Manual, CSHL, New York, 1988). 

20 Methods of constructing such antibodies are discussed above, in Exan^le 7. 

Any standard immunoassay format ELISA, Western blot, or RIA assay) can be used to 
measure altered expression of ovarian cancer-related polypeptide or protein levels; con^arison is to 
wild-type (normal) ovarian cancer-related protein levels, and a difference in ovarian cancer-related 
polypeptide levels is indicative of a biological condition resulting from altered expression of ovarian 

25 cancer-related polypeptides or proteins, such as neoplasia. Whether the key dilfefence is an iocrease ' 
or a decrease is dependent on the specific ovarian cancer-related protein under examination, as 
discussed herein. Immunohistochemical techniques may also be utilized for ovarian cancer-related 
polypeptide or protein detection and quantificatioiL For exan^le, a tissue sample may be obtained 
from a subject, and a section stained for the presence of an ovarian cancer-related proteia using the 

30 appropriate ovarian cancer-related protein specific binding agent and any standard detection system 
one which includes a secondary antibody conjugated to horseradish peroxidase). General 
guidance regarding such techniques can be found in, e.g., Bancroft and Stevens (Theory and Practice 
of Histological Techniques, Churchill Livingstone, 1982) and Ausubel et al (Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, 1998). 

35 For the purposes of quantitating an ovarian cancer-related protein, a biological sample of the 

subject, which sample includes cellular proteins, is required. Such a biological sample may be 
obtained firombody cells, such as those present in peripheral blood, urine, saliva, tissue biopsy, 
amniocentesis sanq>les, surgical specimens and autopsy material, particularly breast cells. 
Quantitation of an ovarian cancer-related protein can be achieved by immunoassay and the amount 
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conq)ared to levels of the protein found in healthy cells. A significant difference (either increase or 
decrease) in the amount of ovarian cancer-related protein in the cells of a subject conq>ared to the 
amount of the same ovarian cancer-related protein found in noimal human cells is usually about a 
10% or greater change, for instance 20%, 30%, 40%, 50% or greater difference. Substantial under- 

5 or over-expression of one or nGtore ovaiian cancer-related protein(s), may be indicative of neoplasia or 
a predilection to neoplasia or metastasis, and especially ovarian epithelial cancer. 

The protein-based diagnostic methods as described herein axe predictive of ovarian cancer. 
Cells of any tumors that densonstrate altered expression levels (e.g., throu^ genomic anq)lification, 
deletion, mutation, or other over- or under-expression) of nucleotide sequences that share homology 

10 with the ovarian cancer-related nucleic acids disclosed herein are aggressive tumor cells, and result in 
decreased survival, increased metastasis, increased rates of clinical recunence, and overall worsened 
prognosis. 

EXAMPLE 10 

15 Gene Therapy 

This ex^mplQ describes how to use the ovarian cancer-related molecules and analysis 
methods disclosed herein to effect gene therapy for the treatment of ovarian cancer. 

20 Gene therapy approaches for combating neoplasia (particularly ovarian cancer) in' subjects 

are made possible by the present disclosure. 

Retroviruses have been considered a preferred vector for experiments in gene therapy, with a 
high efficiency of infection and stable integration and expression (see Qrkin et cd.. Prog, Med. Genet. 
7:130-142, 1988). A full-length ovarian cancer-related gene or cDNA can be cloned into a retroviral 

25 vector and driven from either its endogenous promoter or from the retroviral LTR (long terminal 
repeat). Other viral transfection systems may also be utilized for this type of approach, mcluding 
adenovirus, adeno-associated virus (AAV) (see McLaughlin et ah, J. ViroL 62:1963-1973, 1988), 
Vaccinia virus (Moss et al, Annu, Rev. Immunol 5:305-324, 1987), Bovine Papilloma virus 
(Rasmussen et al, Methods Enzymol 139:642-654, 1987) or members of the herpesvirus group such 

30 as Epstein-Bair virus (Margolskee et al, Mol Cell Biol 8:2837-2847, 1988). 

Developments in gene therapy techniques include the use of RNA-DNA hybrid 
oligonucleotides, as described by Cole-Strauss et al {Science 273:1386-1389, 1996). This technique 
may allow for site-specific integration of cloned sequences, thereby permitting accurately targeted 
gene replacement. 

35 In addition to delivery of ovarian cancer-related protein encoding sequences to cells using 

vhal vectors, it is possible to use non-infectious methods of delivery. For instance, Upidic and 
Kposome-mediated gene delivery has recently been used successfully for transfection with various 
genes (for reviews, see Tenqpleton and Lasic, Mol Biotechnol 1 1:175-180, 1999; Lee and Huang, 
Crit Rev. Ther. Drug Carrier Syst. 14:173-206; and Cooper, Semin. Oncol 23:172-187, 1996). For 
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instance, cationic liposomes have been analyzed for their ability to transfect monocytic leukemia 
cells, and shown to be a viable altemative to using viral vectors (de Lima et aL, Mol Membr Biol. 
16:103-109, 1999). Such cationic Uposomes can also be targeted to specific cells through the 
inclusion of, for instance, monoclonal antibodies or other appropriate targeting ligands (see Kao et 
5 al. Cancer Gene IJier, 3:250-256, 1996). 

To reduce the level of ovarian cancer-ielated gene e^qpression, gene therapy can be carried 
out using antisense or otiier suppressive constructs, fhe constmction of which is discussed above 
(Exanq>le 4). 

10 EXAMPLE 11 

Kits 

This exaix^le describes various kits for using fhe ovarian cancer-related molecules and 
analysis methods disclosed herein. 

15 

Kits are provided to determine the level (or relative level) of e:q)ression of one or more 
species of ovarian cancer-related nucleic acids (e^g., mRNA) or one or more ovarian cancer-related 
protein (z.e., kits containing nucleic acid probes or antibodies or other ovarian cancer-related protein 
specific binding agents). Kits are also provided that contain the necessary reagents for determining 
20 gene copy nmnber (genomic amplification or deletion), such as probes or primers specific for an 
ovarian cancer-related nucleic acid sequence. These kits can each include instroctions, for instance 
instructioiis that provide calibration curves or charts to conq>are with the determined (e.g., 
e:q>erimentally measured) values. 

25 ' A. Kits for Detection of Ovarian Cancer-related Genomic 

Amplification or Deletion 

The nucleotide sequence of ovarian cancer-related nucleic acid molecules disclosed herein, 
and fragments thereof, can be supplied in the form of a kit for use in detection of ovarian cancer- 
related genomic anqjlification/deletion and/or diagnosis of progression to or predilection to progress 

30 to ovarian epithelial cancer. In such a kit, an appropriate amoimt of one or more oligonucleotide 
primer specific for an ovarian cancer-related-sequence is provided in one or more containers. The 
oligonucleotide primers may be provided suspended in an aqueous solution or as a freeze-dried or 
lyophilized powder, for instance. The container(s) in which the oUgonucleotide(s) are supplied can 
be any conventional container that is capable of holding tibe supplied form, for instance, microfiige 

35 tubes, ampoules, or bottles. In some applications, pairs of primers may be provided in pre-measured 
single use amounts in individual, typicaUy disposable, tubes, or equivalent containers. With such an 
arrangement, the san^le to be tested for the presence of ovarian cancer-related genomic 
amplification/deletion can be added to the individual tubes and in vitro anotplification carried out 
directly. 
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The amount of each oUgonucleotide primer suppUed in the kit can be any amount, 
depending for instance on the market to which tiie product is directed. For instance, if the kit is 
adapted for research or clinical use, the amount of each oligonucleotide primer provided likely would 
be an amoimt sufficient to prime several in vitro an^lification reactions. Those of ordinary skill in 
5 the art knovr flie amount of oligonucleotide primer that is appropriate for use in a single aicplification 
reaction. General guidehnes may for instance be found in Innis et al (PCR Protocols, A Guide to 
Methods and Applications, Academic Press, Inc., San Diego, CA, 1990), Sambrook et al. Qn 
Molecular Cloning: A Laboratory Manual, CMd Spring Harbor, New York, 1989), and Ausubel et 
al Qn Current Protocols in Molecular Biology^ John Wiley & Sons, New York, 1998). 

10 A kit may include more than two piimeis, in order to facilitate the in vitro amplification of 

ovarian cancer-related genomic sequences (or a protein of such a sequence), for instance an ovarian 
cancer-related nucleic acid listed in Table 1, or tiie 5' or 3* flanking region thereof. 

In some embodiments, kits may also include the reagents necessary to carry out in vitro 
anopUfication reactions, including, for instance, DNA sample preparation reagents, appropriate 

15 buffers (e.g., polymerase buffer), salts (e.g., magnesium chloride), and deoxyribonucleotides 
(dNTPs). Written instructions may also be included. 

Kits may in addition include either labeled or imlabeled oHgonucleotide probes for use in 
detection of the in vitro amplified sequences. The appropriate sequences for such a probe will be any 
sequence that falls between the amealing sites of two provided oligonucleotide primers, such that the 

20 sequence the probe is coniplementary to is an5)Ufied during the in vitro amplification reaction (if it is 
present in the sample). 

It may also be advantageous to provided in the kit one or more control sequences for use in 
the in vitro an^plification reactions. The design of appropriate positive control sequences is well 
known to one of ordinary skill in the appropriate art 

25 ■ .... 

B. Kits for Detection of mRNA Expression 

Kits similar to those disclosed above for the detection of ovarian cancer-related genonnc 
amplification/deletion can be used to detect ovarian cancer-related mKNA expression levels 
(including over- or under-expression, in conparison to the expression level in a control sample). 
30 Such kits include an appropriate amount of one or more of the oUgonucleotide primers for use in, for 
instance, reverse transcription PCR reactions, similarly to those provided above, with art-obvious 
modifications for use with RNA. 

In some embodiments, kits for detection of ovarian cancer-related mRNA e^q^ression may 
also include reagents necessary to carry out RT-PCR or other in vitro anqjlification reactions, 
35 including, for instance, RNA saiiq>le preparation reagents (including e.g., an RNAse inhibitor), 
appropriate buffers {e,g,, polymerase buiffer), salts (e.g., magnesium chloride), and 
deoxyribonucleotides (dNTPs). Written instructions may also be included. 

Kits may in addition include either labeled or unlabeled oligonucleotide probes for use in 
detection of an in vitro anq)lified target sequence. The appropriate sequences for such a probe will be 
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any sequence that falls between the annealing sites of the two provided oligonucleotide primers, such 
that the sequence the probe is complementary to is anq)lified during the PGR reactioa 

It may also be advantageous to provided in the kit one or more control sequences for use in 
the in vitro an5>lification reactions. The design of appropriate positive control sequences is well 
5 known to one of ordinary skill in the appropriate art. 

Alternatively, kits may be provided with the necessary reagents to carry out quantitative or 
semi-quantitative Northem analysis of ovarian cancer-related mRNA. Such kits include, for instance, 
at least one ovarian cancer-related sequence-specific oligonucleotide for use as a probe. This 
oligonucleotide may be labeled in any conventional way, including with a selected radioactive 
10 isotope, enzyme substrate, co-&ctor, ligand, chemiluminescent or fluorescent agent, hapten, or 
en2:yme. 

C. Kits For Detection of Ovarian Cancer-linked Protein or 
Peptide Expression 

15 Klits for the detection of ovarian cancer-linked protein expression, for instance altered (over 

or under) expression of a protein encoded for by a nucleic acid molecule listed in Table 1 or 
elsewhere, are also enconqDassed herein. Such kits may include for example at least one target 
(ovarian cancer-Unked) protein all or a portion of the small cell limg carcinoma cluster 4 antigen 
(CD24) (SEQ ID NO: 181-182), secretory leukocyte protease inhibitor antileukoproteinase (SLPI) 

20 (SEQ ID NO: 340-341), secreted phosphoprotein 1 (SPPl) (SEQ ID NO: 342), B-factor, properdin 
(BF) (SEQ ID NO: 343-344), *liomolog of Cksl«p34Cdc28/Cdc2-associated protein" (CKSI) (SEQ 
ID NO: 345-347), matrix metalloproteinase 7 (MMP7) (SEQ ID NO: 348-349), paired box gene 8 
(PAX8) (SEQ ID NO: 350-351), serine protease inhibitor, Kunitz type, 2 (SPINT2) (SEQ ID NO: 
352-353), ZWIO interactor {ZWINT) (SEQ ID NO: 354), diacylglycerol kinase (DGKH) (SEQ ID 

25 NO: 355), high-mobility group (nonhistone chromosomal) protein isbforms I and Y (HMGIY) (SEQ 
ID NO: 356), Syndecan-4 - an5)higlycan - ryudocan core protein (SDC4) (SEQ ID NO: 357-359), 
cyclin-dependent kinase inhibitor 2A (CDKN2A) (SEQ ID NO: 360), sodium channel, nonvoltage- 
gated 1 alpha (SCNNIA) (SEQ ID NO: 361-362), lactate dehydrogenase A (LDHA) (SEQ ID NO: 
363), adult folate receptor (FOLRI) (SEQ ED NO: 364-365), Triosephosphate isomerase 1 (ZP/i) 

30 (SEQ ID NO: 366-367), kallikrein 8 (neuropsin/ovasin) (KLK8) (SEQ ID NO: 368), CXC chemokine 
receptor 4- fusin-neuropeptide Y receptor-L3 (CXCR4) (SEQ ID NO: 200), kinesin-like 1 {KNSLJ) 
(SEQ ID NO: 369-370), H2A histone family, member O (H2AF0) (SEQ ED NO: 371-372), major 
histocompatibility complex, class H, DR alpha, HLA-DRA, cysteine-rich protein 1 (intestinal) 
(CRIPI) (SEQ ID NO: 375), pyrophosphatase (inorganic), (PP) (SEQ ID NO: 376), EST (SEQ ID 

35 NO: 377-378) 666391, glucose transporter (HepG2), (SLC2A1) (SEQ ID NO: 379-381), EST (SEQ 
ID NO: 377-378) 897770, hepatoma-derived growth fector (HDGF) (SEQ ID NO: 383-385), 
argininosuccinate synlhetase {ASS) (SEQ ID NO: 386), claudin 4 (CLDN4) (SEQ ID NO: 387-388), 
preferentially e3q)ressed antigen in melanoma {PRAME) (SEQ ID NO: 389), LAR = LCA-homologue 
(PTPRF) (SEQ ID NO: 390-391), eyes absent (Drosophila) homolog 2 (EY42) (SEQ ID NO: 392- 
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393), L-myc (MYCLJ) (SEQ ID NO: 394-396), STAT1=IFN alpha^eta-responsive transcription 
factor ISGF3 beta subunits (p91/p84) (STATJ) (SEQ ID NO: 397-399), mitochondrial carrier 
homolog 2 (MTCH2) (SEQ ID NO: 400-401), 5-hydroxytryptanune (serotonin) receptor 3A {HTRSA) 
(SEQ ID NO: 402), cyclin El (CCNEl) (SEQ ID NO: 403-404), cadherin 6, type 2, K-cadherin (fetal 
5 kidney) (CDH6) (SEQ ID NO: 405), 5*-AMP-activated protein kinase - gamma-l subunit (PRKAGl) 
(SEQ ID NO: 406-408), defensin beta 1 (DEFBI) (SEQ ID NO: 409), actin related protein 2/3 
con5)lex, subunit lA (41 kD) {ARPCJB) (SEQ ID NO: 410-41 1), PKC iota=Protem kinase C, iota 
{PRKCI) (SEQ ID NO: 412-414), glyceraldehyde-3-pliosphate dehydrogenase {GAPD) (SEQ ID NO: 
415), con5>lement component 2 (C2) (SEQ ID NO: 416-417), H2A histone family, member Y 

10 {H2AFY) (SEQ ID NO: 418-419), transmembrane 4 superfemily member 1 (TM4SF1) (SEQ ID NO: 
420-421), glyceraldehyde-3-phosphate dehydrogenase (GAPD) (SEQ ID NO: 422-423), Interferon- 
inducible protein 1-8U {IFITM3) (SEQ ID NO: 424-426), glycine dehydrogenase (decarboxylatmg; 
glycine decarboxylase, glycine cleavage system protein P) (GLDC) (SEQ ID NO: 427-428), 
calumenin (CALU) (SEQ ID NO: 429-430), hemoglobin alpha 2 (HBA2) (SEQ ID NO: 431-432), 

15 SlOO calcium-binding protein Al 1 (calgizzarin) (SIOOAIJ) (SEQ ID NO: 433), Lactate 

dehydrogenase A (LDHA) (SEQ ID NO: 434-436), ubiquitin-conjugating enzyme E2C (UBE2C) 
(SEQ ID NO: 437), E2F-3=pRB-binding transcription factor=KIAA0075 (E2F3) (SEQ ID NO: 438- 
440), E-cadherin (CDHI) (SEQ ID NO: 441-442), proteasome (prosome, macropain) activator 
subunit 2 (PA28 beta) (PSME2) (SEQ ID NO: 443-444), OP-l=osteogenic protein in the TGF-beta 

20 family (BMP7) (SEQ ID NO: 445-447), or topoisomerase H iT0P2A) (SEQ ID NO: 448) specific 

binding agent (e.g., a polyclonal or monoclonal antibody or antibody fragment), and may include at 
least one control. The ovarian cancer-linked protein specific binding agent and control noay be 
contained in separate containers. The kits may also include a means for detecting ovarian cancer- 
related protein:agent conq)lexes, for instance the agent may be detectably labeled. If Ifae detectable 

25 agent is not labeled, it may be detected by second antibodies or protein A, for exan^le, either of both 
of which also may be provided in some kits in one or more separate containers. Such techniques are 
well knowa 

Additional components in some kits include instructions for carrying out the assay. 
Instructions will allow the tester to determine whether ovarian cancer-linked expression levels are 
30 elevated or reduced in comparison to a control sample. Reaction vessels and auxiliary reagents such as 
chromogens, buffers, enzymes, etc. also may be included in the kits. 
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EXAMPLE 12 
Identification of Therapeutic Compounds 

This exaniple describes how to use the ovarian cancer-related molecules disclosed herein to 
5 identify con:q)ounds for potential therapeutic use in treating, reducing, or preventing ovarian cancer or 
development or progression of ovarian cancer. 



The ovarian cancer-related molecules disclosed herdn, and more particularly the Imkage of 
&ese molecules to cancer, can be used to identify conopounds that are useful in treatmg, reducing, or 
10 preventing ovarian cancer or development or progression of ovarian cancer. These molecules can be 
used alone or in combination, for instance in sets of two or more that are linked to cancer or cancer 
progression. 

By way of example, a test compound is applied to a cell, for instance a test cell, and at least 
one ovarian cancer-related molecule level and/or activity in the cell is measured and compared to the 

IS equivalent measurement &om a test cell (or from the same cell prior to application of Ifae test 
compound). If application of the conapound alters the level and/or activity of an ovarian cancer- 
related molecule (for instance by increasing or decreasing that level), then that compound is selected 
as a likely candidate for further characterization. In particular examples, a test agent that opposes or 
inhibits an ovarian cancer-related change is selected for further study, for example by exposing the 

20 agent to an ovarian epithehal cancer cell in vitro, to determine whether in vitro growth is inhibited. 
Such identified confounds may be useful in treating, reducing, or preventing ovarian cancer or 
development or progression of ovarian cancer. In particular embodiments, the con:qx)und isolated 
will inhibit or inactivate an ovarian cancer-related molecule, for instance tibiose represented by the 
nucleic acids listed in Table I . 

25 Methods for identifying such compounds optionally can include the g^eration of ah ovarian 

cancer-related gene egression profile, as described herein. Control gene e3q}ression profiles useful 
for conoparison in such noediods may be constructed firom normal ovarian tissue, including primary 
ovarian cancer tissue. 



30 EXAMPLE 12 

Gene Expression Profiles (Fingerprints) 

This exaiEple describes how to use the ovarian cancer-related nucleic acids and analysis 
methods disclosed herein to generate and use gene expression profiles, or "fingerprints." 

35 

With the provision herein of methods for determining molecules that are linked to ovarian 
cancer, and the provision of a latge collection of such ovarian cancer-linked molecules (as 
represented for mstance by diose listed in Table 1), gene e:q>ression profiles that provide information 
on the ovarian cancer-state of a subject are now enabled. 
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Ovarian cancer-related expression profiles conqjrise the distinct and identifiable pattern of 
e;q}ression (or level) of sets of ovarian cancer-related genes, for instance a pattern of high and low 
expression of a defined set of genes, or molecules that can be correlated to such genes, such as 
mRNA levels or protein levels or activities. Useful sets of molecules for constracting nucleic acid 
5 e3q>ression profiles include at least one fliat is represented by the following genes and ESTs: 

BCKDHB (SEQ ID NO: 16-17), ZNF33A (SEQ ID NO: 20-22), EST 192198 (SEQ ID NO: 25), EST 
128738 (SEQ ID NO: 26-27), EST 42921 1 (28-29), FU22174 (SEQ ID NO: 30-31), EST 415562 
(SEQ ID NO: 32-33), EST 296488 (SEQ ID NO: 34-35), EST 120124 (SEQ ID NO: 36-37), EST 
132142 (SEQ ID NO: 38-39), EST 50635 (SEQ ID NO: 40), FOR (SEQ ID NO: 41-43), EST 73702 
10 (SEQ ID NO: 46-47), EST 2218314 (SEQ ID NO: 48), EST 2261113 (SEQ ID NO: 49), IFITMl 

(SEQ ID NO: 50-54), IFITMl (SEQ ID NO: 55-59), KIAA0203 (SEQ ID NO: 61-62), G1F3 (SEQ ID 
NO: 68-69), BST2 (SEQ ID NO: 70-72), EST 1384797 (SEQ ID NO: 196) , TLR3 (SEQ ID NO: 199- 
201), SPONI (SEQ ID NO: 160-161), HSRNASEB (SEQ ID NO: 162-163), EST 294506 (SEQ ID 
NO: 146-148), SORLl (SEQ ID NO: 149-151), SUTl (SEQ ED NO: 73), PLl (SEQ ID NO: 77), EST 
15 108422 (SEQ ID NO: 78-79), CEBPG (SEQ ID NO: 80), HLA^DPA (SEQ ID NO: 97-99), H2AFL 
(SEQ ID NO: 107-109), IGKC (SEQ ID NO: 1 12-1 16), SCYBIO (SEQ ID NO: 120-121), RGSl (SEQ 
ID NO: 122-126), LSR68 (SEQ ID NO: 168), SGK (SEQ ID NO: 176-178), and ZFP36 (SEQ ID NO: 
167-173). These genes and ESTs, which have not previously been correlated with cancer, present 
potentially useful novel markers for cancer, and in particular, ovarian cancer. 
2^ A second example set of molecules that could be used in a profile would include at least one 

fliat is represented by (or correlated to) flie genes and ESTs represented by the SEQ ID NOs in Table 
9. These nucleic acids, which are disclosed herein to be differentially expressed in ovarian cancer 
(see Figure 2), are suitable for markers to diagnose, prognose, and monitor ovarian cancer in a 
subject. In addition, these genes and ESTs are potentially useful as markers for classifying tumors 
25 into types, for instance into j8i?G4i.type or MG42-type tumors, using the methods discloseffherein. 

A third exanq)le set of molecules fliat coy ld be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NOs: 417, 284, 
285, 281, 283, 278, 273, 282, 274. These represent markers disclosed herein that were found to be 
differentially expressed between BRCAl-Urik&d and sporadic tumors in a comparison to reference 
30 Immortalized Ovarian Epithelial Cells (lOSE). These markers are useful for classifying tumors into 
^iJG4i-linked and sporadic types, and present potential targets for treatment of ovarian cancer. 

A fourth exanq)le set of molecules that could be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NOs: 279-280, 
which, as disclosed herein, are markers that were found to be diflSsrentially expressed between 
35 BRCA2'Ur^Qd and sporadic tumors in a conyarison to reference Immortalized Ovarian Epithelial 
Cells. These markers are usefiil for classifying tumors into iff/?C42-linked and sporadic types, and 
present potential targets for treatment of ovarian cancer. 

A fifth exan^le set of molecules ±at could be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NOs: 281, 282 ' 
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and 274, which, as disclosed herein, are markers that were found to be differentially expressed 
between combined BRCA-Lioked and sporadic tumors in a comparison to reference Immortalized 
Ovarian Epithelial Cells. These markers are useful for classifying tumors into BRCA-linked and 
sporadic types, and present potential targets for treatment of ovarian cancer. 
5 A sixth QXdmple set of molecules that could be used in a profile wo.uld include at least one 

that is represented by (or correlated to) the genes and ESTs represented by the SEQ ID NOs set forth 
in Table 10, which, as disclosed herein, are markers that can be used to segregate ^i^C^i-linked from 
BRCA2-hxjked tumor types using coziq)ound covaiiate prediction analysis. These markers are useful 
for classifying tumors into one of two types of tumors, which provides information helpful to a 
10 clinician in choosing a course of treatment for the patient based on the type of tumor into which the 
smsplQ is classified. 

A seventh exanq>le set of molecules that could be used in a profQe would include at least 
one that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NO: 16-201, 
565-567, and 803-804. These genes and ESTs were found, as disclosed herein, to be differentially 

15 ejqjressed in a comparison of BRCAI-]mkGd and BRCA2']3hksd to sporadic tumors. Hence, fliese 
genes and ESTs present potentially useful markers for classifying tumors into types, using the 
methods disclosed herein. Furthermore, they represent potential targets for pharmaceutical treatment 
of tumors of each respective tumor type. 

A eighth example set of molecules that could be used in a profile would include at least one 

20 fliat is represented by (or correlated to) the genes and ESTs represented by SEQ ID NOs: 124-126, 
3 19, 429-430, 504-523, 533-535, 544, and 548-799. As disclosed herein, ftese nucleic acids were 
found to be overe7q}ressed in a comparison of £jRG47-linked, ^iiC12-linked and sporadic tumor 
samples. Hence, these genes and ESTs present potentially usefiil markers for classifying tumors into 
types, using the methods disclosed herein. Furthermore, they represent potential targets for 

25 pharmaceuticaltreatmentof tumors' of each respective tumor type. 

A ninth example set of molecules that could be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NOs: 202-339. 
As disclosed herein, these nucleic acids were fomd to be overexpressed in ovarian cancer in a 
comparison of ovarian epithelial cancer to normal postmenopausal ovarian tissue. Hence, these genes 

30 and ESTs present potentially useful markers diagnosis, prognosis, and monitoring of ovarian cancer. 
In addition, they represent potential targets for pharmaceutical treatment of ovarian tumors. 

A tenth example set of molecules that could be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NOs:97 and 340- 
448. As disclosed herein, fliese nucleic acids were found to be underexpressed in ovarian cancer in a 

35 comparison of ovarian epithelial cancer to normal postmenopausal ovarian tissue. Hence, these genes 
and ESTs present potentially useful markers diagnosis, prognosis, and monitoring of ovarian cancer. 
In addition, they represent potential targets for pharmaceutical treatment of ovarian tumors. 

In other exanq}les of ovarian cancer-related gene expression profiles, such profiles may be 
further broken down by the manner of molecules included in the profile. Thus, certain exanqples of 
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profiles coay include a specific class of ovarian cancer markers, such as those molecules involved in 
cell cycle control. 

Particular profiles may be specific for a particular stage of normal tissue (e.g., ovarian 
tissue) growth or disease progression (e.g., progression of ovarian cancer). Thus, gene expression 
5 profiles can be established for a pre-ovarian cancer tissue (lcl, normal ovarian tissue), and a primary 
ovarian cancer tissue. Each of these profiles includes information on fiie expression level of at least 
one, but usually two or more, genes that are linked to ovarian cancer (eg., ovarian cancer-related 
genes). Such information can include relative as well as absolute egression levels of specific genes. 
Likewise, the value noeasured may be the relative or absolute level of protein expression, which can 
1 0 be correlated with a ''gene e3q)ression level." Results 6om the gene expression profiles of an 

individual subject are often viewed in the context of a test sanq>le conq>ared to a baseline or control 
saxiq>le fingerprint. 

The levels of molecules that make up a gene e3q>ression profile can be measured in any of 
various known ways, which may be specific for the type of molecule being measured. Thus, nucleic 

15 acid levels (such as direct gene expression levels, such as the level of mRNA expression) can be 

measured using specific nucleic acid hybridization reactions. Protein levels may be measmred using 
standard protein assays, using inomunologic-based assays (such as ELISAs and related techniques), or 
using activity assays, for instance. Examples for measuring nucleic acid and protein levels are 
provided herein; other methods are well known to those of ordinary skill in the art 

20 Examples of ovarian cancer-related gene e?q)ression profiles can be in array format, such as 

a nucleotide (^.g., polynucleotide) or protein array or microairay. The use of arrays to determine the 
presence and/or level of a collection of biological macromolecules is now well known (see, for 
example, methods described in published PCT application nunaber W09948916, describing hypoxia- 
related gene e^qxression armys). In array-based measurement methods, an array may be contacted 

25 with polynucleotides (in the case of a nucleic acid-based array) or polypeptides (in the case of a 
protein-based array) from a ssujople from a subject The amount and/or position of binding of the 
subject's polynucleotides or polypeptides then can be determined, for instance to produce a gene 
expression profile for that subject. Such gene expression profile can be conopared to another gene 
expression profile, for instance a control gene expression profile from a subject having a known 

30 gynecological or ovary-related condition. Optionally, the subject's gene expression profile can be 
correlated with one or more appropriate treatments, which may be correlated with a control (or set of 
control) expression profiles for stages of ovarian cancer, for instance. 

This disclosure provides the identification of ovarian cancer-related molecules that exhibit 
35 alterations in expression during development of ovarian cancer, and e?qjression fingerprints (profiles) 
specific for ovarian cancers. It further provides methods of using these identified nucleic acid 
molecules, and proteins encoded thereby, and expression fingerprints or profiles, for instance to 
predict and/or diagnose ovarian cancer, and to elect treatments for instance based on likely response. 
These identified ovarian cancer-related molecules also can serve as therapeutic targets, and can be 
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used in methods for identifying, developing and testing therapeutic conq)ounds. It will be apparent 
that the precise details of the methods described may be varied or modified without departing fi-om 
the spirit of the described invention. We claim all such modifications and variations that fall within 
the scope and spirit of the claims below. 
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CLAIMS 

We claim: 

5 1 . A method of classifying an ovarian tumor as a BRCA-l-like or BRCA-2-like or 

non-BRCA-tOce tumor, comprising: 

determining a pattern of e3q>ression in the ovarian tumor of a plurality of markers listed in 
Table 1 , wherein the pattern of expression in the ovarian tumor is determined relative to a standard 
ovarian tissue; 

10 con^aring a similarity of the pattem of e7q)ression of the pluraUty of markers in the ovarian 

tumor to a pattem of egression of the pluraUty of markers in a conq)arison tissue of a known BRCA- 
1-like or BRCA-2-like or non-BRCA-like tumor, wherein the pattem of e^qiression in the conq)arison 
tissue is determined relative to the standard ovarian tissue; 

wherein a similarity of the pattem of expression in the ovarian tumor to a pattem of 

15 expression of the comparison tissue of the known BRCA-l-like tumor classifies the ovarian tumor as 
a BRCA- 1-like tumor, a similarity of the pattem of e5q>ression in the ovarian tumor to a pattem of 
expression of the known BRCA-2-like tumor classifies the ovarian tumor as a BRCA-2-like tumor, 
and a similarity of the pattem of expression in the ovarian tumor to a pattem of expression of the 
known non-BRCA-like tumor classifies the ovarian tumor as a non-BRCA-like tumor. 

20 

2. The method of claim 1, wherein the mefliod conoprises determining a pattem of 
over-expression or under-e?g)ression of the plurality of markers in the ovarian tumor to over- 
e3q)ression or under-expression of the plurality of markers of &e conq>arison tissue. 

* 25 3. ' ' The method of claun 2, wherein the method comprises detemodning a pattem of 

both over-e?9ression and nnder-expression of the plurality of markers in the ovarian tumor to over- 
expression or under-expression of the plurality of markers in the comparison tissue. 

4. The method of claim 1 wherein the coniparison tissue is from a known BRCA-1- 
30 like tumor, and the method comprises determining whether the ovarian tumor is a BRCA-1 -like 

tumor by con:q)aring the pattem of expression in the ovarian tumor to the pattem of e;q)ression in the 
comparison tissue. 

5. The method of claim 1 wherein the coniparison tissue is from a subject known to 
35 have a mutation in BRCA-1 and the method corqprises determining whether the ovarian tumor is a 

BRCA-l-like tumor by con5)aring the pattem of ejq)ression in the ovarian tumor to the pattem of 
egression in the con::{)arison tissue. 
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6. The metiiod of claim 1 wherein the coiiQ)arison tissue is jfrom a subject known to 
have a mutation in BRCA-2 and the method con:^rises determining whether the ovarian tumor is a 
BRCA-2-like tumor by comparing the pattern of egression in the ovarian tumor to the pattern of 
expression in the comparison tissue. 

5 

7. The method of claim 1 wherein the comparison tissue is 6om a known BRCA'-2- 
like tumor, and the method comprises detenmning whether the ovarian tumor is BRCA-2-like by 
con9)aring the pattern of e3q)ression in the ovarian tumor to the pattern of expression in the 
comparison tissue. 

10 

8. The method of claim 7, wherein classifying the ovarian tumor conq)rises 
detemnning whether a tumor ttiat does not contain a BRCA-1 or BRCA-2 mutation is BRCA-l-like 
orBRCA-2-like. 

15 9. The method of claim 1 wherein the comparison tissue is from a known non-BRCA- 

like tumor, and the method conqjrises determining whether the ovarian tumor is non-BRCA-like by 
comparing the pattern of expression in the ovarian tumor to the pattern of expression in the 
comparison tissue. 

20 10. The method of claim 1, wherein the standard ovarian tissue is tissue from an 

immortalized ovarian cell, ovarian tissue from a subject not having ovarian cancer, a subject not 
predisposed to developing ovarian cancer, or ovarian tissue from a subject from whom the ovarian 
tumor was obtained at an earlier point in time. 

25 ' ' 11 . The method of claim 1, wherein the patterns of expression ue patterns of 

logarithmic expression ratios. 

12. The method of claim 1, wherein the patterns of esqpression are multidinnensional 
scaling pattems. 

30 

13. The method of claim 12 wherein the multi-dimensional scaling pattems are visually 
compared to determine similarities. 

14. The method of claim 1, wherein the pattems of expression are hierarchical 
35 clustering pattems. 

15. The method of claim 14, wherein standard normal deviation values of the 
logarithuiic expression ratios are assigned relative color intensities that assist in the visual 
conq[)arison. 
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16. The method of claim 15, wherein the hierarchical clustering patters are visually 
conq)ared to determine similarities. 

5 17. The me&od of claim 1 1 comprising comparing the logarithmic expression ratios of 

the plurality markers using compound covariate predictor analysis. 

18. The mediod of claim 1 1, wherein the method con:9>rises differentiating a BRCAl- 
like ovarian tumor from a sporadic ovarian tumor by con^aring relative logarithmic expression ratios 

10 of at least one marker shown in Table 6. 

19. The method of claim 18, wherein differentiating a 5/?C47-linked ovarian tumor 
from a sporadic ovarian tumor comprises comparing the relative logarithmic expression ratios of 
CD72 (SEQ ID NO: 805), SLC25A11 (SEQ ID NO: 544), LCN2 (SEQ ID NO: 545-547), PSTPIPl 

15 (SEQ ID NO: 538-540), SIAHBPl (SEQ ID NO: 543), UBEl (SEQ ID NO: 533), WAS (SEQ ID NO: 
524-526), IDH2 (SEQ ID NO: 541-542), or PCTKJ (SEQ ID NO: 527-528) in the ovarian tumor and 
comparison tissue. 

20. The method of claim 1 1, wherein the method comprises difTerentiating a BRCAZ- 
20 like ovarian tumor from a non-BRCA-like ovarian tumor by compaiing relative logarithmic 

expression ratios of at least one marker shown in Table 7. 

2 1 . The metihod of claim 20, wherein the method conoprises conq)aring the relative 
logarithmic expression ratios of L0C51 760 (SEQ ID NO: 279) or LRPAPl (SEQ ID NO: 280) to 

25 differentiate a BRCA2Ajk^ ovarian tumor from a non-BRCA like ovarian tuinor " 

22. The method of claim 2 1 , wherein the method con5)rises differentiating a non- 
BRCA-like tumor from a BRCA-l-like or BRCA-2-like ovarian tumor by con^aring relative 
logarithmic expression ratios of at least one marker shown in Table 8. 

30 

23. The method of claim 22, wherein the method comprises comparing relative 
logarithmic ejqpression ratios of PSTPIPl (SEQ ID NO: 281), IDH2 (SEQ ID NO: 282), or PCTKl 
(SEQ ID NO: 274) to differentiate a combined BRCAI- and ARG42-linked ovarian tumor from a 
sporadic ovarian tumor. 

35 

24. The method of claim 1 1, wherein the method comprises differentiating a BRCAl- 
like ovarian tumor from a BRCA2'']ikt ovarian tumor by conq>aring relative logaritiunic expression 
ratios of at least one marker shown in Table 10. 
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25. The method of claim 1, wherein the mefbod further co]iq)rises selecting a treatment 
strategy based on classifying the ovarian tumor as BRCAl-Wse^ BRCA2- like or non-BRCA-WsR, 

26. The method of claim 25, wherein the treatment strategy comprises selectiDg a more 
5 aggressive treatment regimen for a BRCAIASsg or BRCA2''Vlk& tumor. 

27. The method of claim 26, wherein the treatment is chemothmpy, radiotherapy, or 
surgical removal of the affected tissue and/ot surrounding area. 

10 28. The method of claim 25, further comprising treating the subject with the selected 

treatment. 

29. The method of claim 1 1 , wherein con^aring the patterns of logarithmic expression 
ratios conqjrises comparing the logarithmic expression ratios to patterns of logarithmic expression 
15 ratios in a database of patterns associated with jBi2G4/-like, MG42-like or non-BRCA-like ovarian 
tumors. 



30. The method of claim 1 1, wherein comparing patterns of logarithmic expression 
ratios of the plurality of markers comprises obtaining the pattern of expression of the plmality of 

20 markers on an array. 

31. The mediod of claim 1, wherein the pattern of expression of the plurality of . 
markers conq)rises over-e?qpression of one or more markers compared to flie standard. 

25 32. . The method ofclaim 29, wherein the one or rnoreinaikers that is overexpressed is 

listed in Table 5. 



33 . The method of claim 32, wherein determining the pattern of expression comprises 
providing nucleic acid sequences of the markers, and performing nucleic acid hybridization of 

30 specific oligonucleotide probes to the nucleic acid sequences. 

34. The method of claim 33, wherein the sequence of the oUgonucleotide probe is 
selected to bind specifically to a nucleic acid molecule listed in Table 1. 

35 35. The method ofclaim 34, flirther comprising an^lifying &e one or more markers 

prior to performing nucleic acid hybridization. 

36. The method ofclaim 33, furtiher con^rising quantitating hybridization to detect a 
level of differential e}q)ression. 
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37. The method of claim 33, wherein providing sequences of die markers conq)rises 
providing the nucleic acid sequences on an array carrying the phirality of markers. 

S 38. The method ofclaim 37, wherein the array is a cDNAmicroarray. 

39. The method ofclaim 33, wherein providing the nucleic acid sequences of the 
markers comprises providing at least SO of the markers listed in Table 1. 

10 40. The method of claim 33, wherein providing the nucleic acid sequences of the 

markers conqprises providing at least 100 of the naarkers listed in Table 1 . 

4 1 . The mefliod of claim 33, wherein providing the nucleic acid sequences of the 
markers comprises providing at least 200 of the markers listed in Table 1. 

15 

42. A method of diagnosing or prognosing development or progression of ovarian 
cancer in a subject comprising detecting under-expression of one or more markers in Table 4 relative 
to a standard. 

20 43 . The method of claim 42, wherein the standard is immortalized ovarian epithelial 

cells, ovarian tissue from a subject not having cancer or a subject not predisposed to developing 
cancer, or ovarian tissue from the subject at an earlier point in time. 

44. The method of claim 42 wherein the one or more markers conoprise a nucleic add 
25 . encoded by SEQ-^EDNOs: 449-503. 

45. A method of diagnosing or prognosing development or progression of ovarian 
cancer in a subject conqmsing detecting over-e;q)ression of one or more markers in Table 5 relative 
to a standard. 

30 

46. The method ofclaim 45j wherein the standard is immortalized ovarian epithelial 
cells, ovarian tissue from a subject not having cancer or a subject not predisposed to developing 
cancer, or ovarian tissue from tiie subject at an earlier point in time. 

35 47. The method of claim 45 wherein the one or more markers comprise a nucleic acid 

encoded by SEQ ID NOs: 18-19. 30-31, 50-51, 52-54, 55-57, 58-59, 60, 68-69, 74-76, 85-86, 87-88, 
89-91, 92-93, 94-95, 97-99, 122-123, 133-135, 149-151, 164-166, 167-168, 169-170, 174-175. 176- 
178, 179-180, 181-182, 190-192, or 199-201. 
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48. A method of screening for an agent for treating or inhibiting ovarian cancer in a 
subject, con^rising exposing a tumor cell to a therapeutically effective amount of a pharmaceutical 
confound that restores wUd-type expression of at least one £JtC47-Iike or SJ2C12-like marker listed 
in Table 1. 

5 

49. The method of claim 48 wherein the agent corrects under-e3q;)ressioa or over- 
expression of a marker listed in Table 1. 

50. A method of monitoring a response to therapy for an ovarian tumor, con9>rising 
10 monitoring eTcpression of the markers in the subject following adnoinisfration of the therapy. 



51. A method of diagnosing or prognosing development or progression of ovarian 
cancer in a subject conq)rising detecting differential expression of a gene that maps to Chromosome 
Xpll.2. 

15 

52. A kit for classifying one or more ovarian tumors as sporadic, BRCAI-Uke or 
£/^C42-like tumors, con^rising con^onents for measuring e^qpression levels of markers in the one or 
more ovarian tumor samples and for con^aring the expression levels of &e markers to Hie markers in 
Table 10. 

20 

53. The kit of claim 52, wherein the e;q)ression levels of a plurality of markers from 
each tumor are measured. 

54. The kit of claim 52, cont^rising an array carrying a plurality of markers. 
25 ■ — • - 

55. The kit of claim 52, comprising a binding molecule that selectively binds to a 
marker in the one or more tumor samples, and wherein the marker is listed in Table 10. 



56, The kit of claim 52, wherein the expression levels measured are of a non-BRCA- 
30 like, ^i?C4i-like or BRCA2'Uk.e tumor protein, and the binding molecule is an antibody or antibody 

fragment that selectively binds the tumor protein. 

57. The kit of claim 52, wherein the expression levels measured are of a BRCA-like, 
BRCAI-iikG or BRCA2Ajke nucleic acid marker, and the binding molecule is an oligonucleotide 

35 capable of hybridizing to tiie nucleic acid molecule mark^. 
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Figure 1 
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Figure 2A 
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Figure 2A' 
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Figure 2C 




BRCAl-like gene BRCA2-like gene 

expression expression 
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Figure 3A 
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Figure 3A' 
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Figure 3B 




Z -2.5 -1.5 -0.5 0 0.5 1.5 2.5 



wo 03/0680S4 



9/60 



PCT/US03/04688 




Z -2.5 -1.5 -0.5 0 0.5 1.5 2.5 



wo 03/068054 



10/60 



PCT/US03/04688 



Figure 3C 

DESIGNATIONS IN FIGURE 3-A 



Designation 


Across (tumor types) 


Down (genes) 


SEQ ID NO: 


1 




PAK2 


557-558 


2 


15/- 10 


NCSTN 


678 


3 




HGF 


585-586 


4 




BAD 


co^ coo 

587-588 


5 


XIO OA 


F23149 


766-767 


6 


■DO 01 


DKFZF564C186 . 


763-764 


7 


tjo 00 


UBLl 


510-512 


8 


"DO 01 


GCAT 


741-742 


9 


ttO OA 


JiBBF4 


691-692 


1 A 

10 




AT TT 
CALU 


0'74r 

375 


11 


no 7 


Kills XI 


CIO CO A 

518-320 


12 


PO_/| 
X3Z-*f 


rlK2B 


624-625 




B2-6 


rur JL 1 


DD 1-ODZ 


14 


B2-7 


1LI8R1 


618-620 


15 


B2-8 


P14L 


743-744 


16 


B2-9 


RALY 


712-713 


17 


B36 


KIAA0218 


703-704 


18 


B39 


MPI 


666-667 


19 


B40 


IL17R 


. 583-584 


20 


B41 


KIAA0008 


745-746 


21 


B52 


ILIB 


775-776 


22 


B54 


RAB3A 


739-740 


23 


B55 


HARS 


662-663 


24 


B60 


TUFM 


659 


25 


B61 


PEF 


719-720 


26 


B62 


GNB2 


674-675 


27 


B63 


SECRET 


655-656 


28 


B64 


SLC9A1 


657-658 


29 


B70 


NAGA 


653-654 


30 


B74 


MNATl 


660-661 
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Figure 3C, continued 



DESIGNATIONS IN FIGURE 3-A 



Designation 


Across (tumor tYDes") 


Down r^ene^^ 


SFO m fid' 


31 


B77 


EST 




32 


B78 


COVAl 


788-7RQ 


33 


B79 


LOX 




34 


B80 


MAPREl 


760-761 

/ / U 1 


35 


ClOO 


FU22059 


770-771 


36 


CI 02 


ILK 




37 


C103 


PISD 




38 


C105 


PPM 




39 


C107 


EIF4A1 


664-665 


40 


Clio 


KIAA0144 


705-706 


41 


C117 


TCEB2 


548-550 


42 


C118 


GART 


600-602 


43 


€77 


TAGLN2 


668-669 


44 


C84 


UBEl 


533 


45 


C85 


FU12442 


701-702 


46 


C99 


PPY2 


733-734 


47 


C79 


MAP2K3 


626-628 


48 


- 0123 


GTPBPl 


697-698 " 


49 


C87 


NM23-H1 


551-553 


50 


C95 


SF3B4 


777 


51 


cm 


AKTl 


504-506 


52 


CI 14 


PPP2R5A 


621-623 


53 


C15 


APMCFl 


731-732 


54 


C16 


ZNF173 


589-591 


55 


C17 


GS2NA 


699-700 


56 


CI 


AFP 


lZ6-m 


57 


C20 


SLC25A11 


544 


58 


C41 


PPPICB 


597-599 


59 


C42 


RBBP2 


792-793 


60 


C45 


SCYB5 


554-556 


61 


C49 


S100A4 


559-561 
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Figure 3D 



DESIGNATIONS IN FIGURE 3-B 



Designation 


Across (tumor type) 


Down (gene) 


SEQIDNO: 


62 


B2-20 


KIAA0365 


676-677 


63 


B2-21 


SFRSll 


682 


64 


B2-22 


KDR 


615-617 . 


65 


B2-23 


SCYA4 


612-614 


66 


B2-24 


SCYA4 


612-614 


67 


B2-25 


RGSl 


398 


68 


B2-4 


RGSl 


398 


69 


B2-6 


RGS16 


594-596 


70 


B2-7 


RGS16 


594-596 


71 


B2-8 


SFRP4 


515-517 


72 


B2-9 


ENPPl 


603-605 


73 


B36 


PDGFRB 


580-582 


74 


B39 


BMP6 


687-688 


75 


B40 


MMP13 


606-608 


76 


B41 


CSRP2 


709 


77 


B52 


WNT2 


797-799 


78 


B54 


WNT2 


797-799 


79 


B55 


APEX 


790-791 


80 


B60 


P0LR2A 


693-694 


81 


B61 


GOLGAl 


782-783 


82 


B62 


CSNKIE 


785 


83 


B63 


LOC51605 


759 


84 


B64 


ZNF211 


757-758 


85 


B70 


FOXOIA 


707-708 


86 


B74 


ZFP161 


592-593 


87 


B77 


ATP7A 


689-690 


88 


B78 


FLJ21940 


768-769 


89 


B79 


TNRCn 


784 
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Figure 3D, continued 

DESIGNATIONS IN FIGURE 3-B 



Designation 


Across (tumor type) 


Down (gene) 


SEQ ID NO: 


OA 


B80 


nt ATI 

TALI 


521-523 


01 
yl 


ClOO 


NCOAl 


565-567 




C102 


BRE 


710-711 


93 


C103 


RAB2L 


780-781 


94 


C105 


SAST 


723-724 


95 


C107 


ITGAE 


571-573 


96 


Clio 


ARHGEF6 


679-681 


97 


C117 


TCF4 


779 


no 
9o 


C118 


TMEPAI 


755-756 


OA 

99 


C77 


CD36 




1 AA 
100 


C85 


PTEN 


507-509 


101 


POO 




//4 


102 


C79 


CD83 


562-564 


103 


C123 


FIJ10701 


762 


104 


C87 


LOC51760 


534-535 


105 


C95 


SMGl 


695-696 


106 


cm 


WNT2 


797-799 


107 


CI 14 


IL7 


574-576 


108 


C15 


CRBl 


735-736 


109 


C16 . 


GABRP . 


685-686 


110 


C17 


PLXNA2 


727-728 


111 


CI 


RNAC 


319 


112 


C20 


CUGBPl 


683-684 


113 


C41 


PONl 


670-671 


114 


C42 


RYBP 


568-570 


115 


C45 


CD36 


577-579 


116 


.•C49 


FU21661 


737-738 
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Figure 4A-C 



A 
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Figure 4A-C' 
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Figure 5A 
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Figure 5B 
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Figure 6A 
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Figure 6B 



1.8 
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ADDiSNDUM 

Table 1. Markers that were Differentially Expressed in a cDNA Microarray 
Expression Profile of Sixty-One Ovarian Cancer Tumors 



SEQ 
ID 
NO. 


Gene 


lAfAGE 
ID No. 


UniGene 
No. 


GenBank 
Accession 
No. 


Gene Description 


16- 
17 


BCKDHB 


770835 




AA427739; 
AA434304 


Branched chain keto 
acid dehydrogenase 
El, beta polypeptide 
(maple syrup mine 
disease) 


18- 
19 


SERPINF 
2 


82195 




T68859; 
T68934 


serine (or cysteine) 
proteinase inhibitor, 
clade F (alpha-2 
antiplasmin, pigment 
epithelium derived 
factor), mem 


20- 
22 


ZNF33A 


346902 




D31763; 
W78164; 
W79207 


KIAA0065 


23- 
24 


ZNF33A 


246543 




N57658; 
N77515 


Zinc finger protein 33a 
(K0X31) 


25 


EST 


192198 




H41144 


Unknown 


26- 

27 


EST 


128738 
• 




R16726; 
R09980 


Homo sapiens 
cDNA:FU23371 fis, 
clone HEPl 6068, 
highly similar to 
HSTPnSHHomo 
sapiens mRNA for trar 


28- 
29 


EST 


429211 




AA007283; 
AA007282 


ESTs 


30- 
31 


FU22174 


295939 




N67034; 
W04283 


hypothetical protein 
FLJ22174 


32- 
33 


EST 


415562 




W80701; 
W78802 


Unknown 


34- 

35 


EST 


296488 




N70208; 
W01059 


Unknown 


36- 
37 


EST 


120124 




T95064;. 
T95160 


ESTs 


38- 
39 


EST 


132142 




R26164: 
R23610 


Homo sapiens 
cDNA:FU21587 fis, 
clone COL06946 


40 


EST 


50635 




H17921 


ESTs 


41- 
43 


POR 


234180 




S90469; 
H70626; 
H66247 


Cytochrome P450 
reductase 


44- 
45 


CLU 


725877 




AA292226' 
AA292410 


wiUdlCJLiNUl V ^^-J^PiCUldll 

lysis inhibitor, SP- 
40,40, sul&ted 
glycoprotein 2, 
testosterone-repressed 
prostate messenger 


46- 
47 


EST 


73702 




T54544; 
T54585 


Unknown 


48 


EST 


2218314 




AI744768 


Unknown 


49 


. EST 


2261113 




AI609063 


EST 



1 



22/60 



50- 
51 


IFITMl 


755599 




AA419251; 
AA419286 


Interferon induced 
transmenibiaoe {notein 
1 (9-27) 


52- 
54 


HTIMl 


509641 




J04164; 
AA058323; 
AA058453 


Interferon-inducible 
protein 9- 

27<>intetferon-induced 

17kDameinbiane 

protein 


55- 
57 


IFITM2 


624655 




X57351; 
AA187365; 
AAl 87099 


laterfeion-indiiced 
protein 1-8D 


58- 
59 


IFrrM2 


376520 




AA041402; 
AA041501 


Interferon-indaced 
protein 1-8D 


60 


HE4 


786675 




AA451904 


Epididymis-specific, 
whey-acidic protein 
type, four-disulfide 
core; putative ovarian 
carcinoma marker 


61- 
62 


KIAA020 
3 


61008 




T40715; 
T39659 


KIAA0203gene 
product 


63- 
64 


IL8RB 


882183 




M73969; 
AA480683 


CXCR2=IL-8 
Receptor beta 


65- 
67 


VDUPl 


297954 




S73591; 
N68956; 
W00656 


Brain-expressed 
HHCPA78 
hoinolog=Induced in 
HL60 cells treated 
with vitamin D or 
cyclohexinaide 


68- 
69 


G1P3 


782513 




AA448478; 
AA432030 


laterfeion, alpba- 
inducible protein 
(clone IFI-6-16) 


70- 

72 


BST2 


811024 




D28137; 
AA485371; 
AA485528 


Bone marrow stromal 
cell antigen 2 


73 


SIATl 


897906 




AA598652 


Sialyltransferase 1 
(beta-galactoside 
alplia-2y6- 
sialyltransferase) 


74- 
76 


DDRl 


182288 




U48705; 
H41900; 
H41939 


Receptor protein- 

tyrosineldnase 

EDDRI 


77 


PLl 


433573 




AA701655 


Human endogenous 
retrovirus envelope 
region mRNA (PLl) 


78- 
79 


EST 


108422 




T77847; 
T77926 


Homo sapiens, clone 
MCX:: 12275, mKNA, 
complete cds 


80 


CEBPG 


455121 




AA676804 


CCAAT/enhancer 
binding protein 
(C/EBP), gamma 


81- 
82 


MUCl 


840687 




AA488073; 
AA486365 


Mucin 1, 
transmembiane 


83- 

84 


CP 


223350 




H86554; 

H86642 


ceruloplasmin 
(ferroxidase) 


85- 
86 


HLA- 
DRBS 


811139 




AA485739; 
AA486460 


Major 

liistocompatibility 
conq)lex, class n, DR 
betas 


87- 


HLA- 


417711 




W88967; 


Major 
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8S 


DRBl 






W88546 


histoconaqpatibility 
co]iq>lex, class II, DR 
betal 


89- 
91 


CD74 


725751 




X00497; 
AA399225; 
AA292218 


Invariant cbaii^la- 
associated invariant 
gamma-chain 


92- 
93 


CD74 


840681 




AA488071; 
AA486363 


Invariant chain=la- 
associated invariant 
gamma-chain 


94- 
96 


HLA- 
DRA 


117411 




K01171; 
T89719; 
T89816 


MHC Class n=DR 
alpha 


97- 
99 


HLA- 
DPA 


207715 




X00457; 
H62294; 
H62293 


MHC Class IIbDP 
a^ha 


100 


HLA- 
DRBl 


855547 




AA664195 


Major 

histoconq>atibility 
complex, class II, DR 
beta 1 


101- 
103 


HLA- 
DRBl 


470953 




M20430: 
AA032179; 
AA033653 


MHC Class n=DR 
beta 


104- 

106 


TNFSFIO 


203132 




U57059; 
H54629; 
H54628 


TRAILr=Apo-2 ligand 


107- 
109 


H2AFL 


429091 




U90551; 
AA007585; 
AA007574 


Histone-2A-like 
protein (H2Ayi) 


110- 
111 


IG 
lambda 


66560 




T67053; 
T67054 


Immunoglobulin 
lambda locus 


112- 
114 


IGKC 


159142 




M63438; 
R76324; 
R76323 


Immunoglobulin 
kappa light chain 


115- 
116 


IGKC 


840451 




AA485725; 
AA485862 


Immunoglobulin 
kappa light chain 


117- 
119 


RAD23A 


293925 




AF004230; 
N63943; 
N98412 


MIR- 

7=Tnonocyte/macropha 
ge Ig-related recq)tor 
AND RAD23=UV 
excision repair protein 
(Double hit) 


120- 
121 


SCYBIO 


967284 




X02530; 
AA527139 


IP-10 


122- 
123 


RGSl 


361323 




AA017544; 
AA017417 


regulator of G-protein 

signaling 1 


124- 
126 


RGSl 


686248 




S59049; 
AA262268; 
AA262879 


BL34=RGSl=regulato 
r of G-protein 
signaling which 
inhibits SDF-1 
directed B cell 
migration 


127- 
129 


GASl 


365826 




L13698; 

AA025819; 
AA025884 


Growdi anest-specific 
1 


130- 
132 


BTG2 


358214 




U72649; 
W95415; 
W95512 


BTG2=T)53 dependent 
inducible anti- 
proliferative gene 
homologous to 
Pc3Aris21 immediate 
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early genes 


133- 
135 


FOS 


755279 




V01512; 
AA496353; 
AA496403 


c-fos 


136 


LSR68 


1862044 




AI053597 


lipopolysacchaiide 
specific respoiise-68 

protein 


137- 
138 


JUNB 


309864 




N94468; 
W23847 


Jun B proto-oncogene 


139- 
140 


JUNB 


122428 




T99236; 
T99280 


Jim B proto-oncogene 


141- 
143 


C0L3A1 


122159 




X14420; 
T98612; 
T98611 


Collagen Type in 
Alpha 1 


144- 
145 


LUM 


813823 




AA447781; 
AA453712 


lumican 


146- 
148 


EST 


294506 




U90916; 

N71007; 
W01902 


clone 23815 mRNA 


149- 
151 


SORLl 


279388 




.Y08110; 
N48698; 
N45548 


Mosaic protein 

LRl l=liybrid receptor 

gp250 precursor 


152- 
153 


KNASE6 
PL 


712341 




AA405000; 
AA281840 


Ribonuclease 6 
precursor 


154- 
156 


HLA-B 


769753 




M28205; 
AA429012; 
AA429162 


Hunaan Leukocyte 
Antigen B 


157- 
159 


HLA-C 


810142 




Ml 1886; 
AA464246; 
AA464354 


Human Leukocyte 
Antigen C 


160- 
161 


SPONl 


46173 




H09099; 
H09449 


Spondin 1, (f-spondin) 
extracellular matrix 
protein 


162- 
163 


HSRNAS 
EB 


814526 




AA459363; 
AA459588 


RNA-binding region 
(RNPl.RRM) 
containing 1 


164- 
166 


ABCSl 


813256 




M14758; 
AA455911; 
AA456377 


MDRl=Multidrug 
resistance protein 1=P- 
glycoprotein 


167- 
168 


ZFP36 


23804 




R38383; 
T77499 


Zinc finger protein 
homologous to Z^36 
in mouse 


169- 
171 


ZFP36 


135880 




M63625; 
R33813; 
R33812 


TTP=tristetraproIine= 
GOS24=zinc finger 
transcriptional 
regulator 


172- 
173 


ZFP36 


727266 




AA411987; 
AA402178 


TTP=tristetraproline= 
GOS24=^c finger 
transcriptional 
regulator 


174- 
175 


HGF 


41650 




R52798; 
R52797 


hepatocyte growth 
factor (bepapoietin A; 
scatter factor) 


176- 
178 


SGK 


840776 




AJ000512; 
AA486082; 
AA486140 


sgk=putative 
serine/Oireonine 
protein kinase 
transcriptionally 
modified during 
anisotonic and isotonic 



4 
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alteration 


179- 
180 


PPP1R7 


814508 




AA459351; 
AA459572 


Protein phosphatase 1, 
regulatory subunit 7 


181- 
182 


CD24 


204335 




H59916; 
H59915 


CD24 antigen (small 
cell lung carcinoma 
cluster 4 antigen) 


183- 
184 


TPD52 


814306 




AA459100; 
AA459318 


Tumor protein DS2 


185- 
187 


CXCR4 


79629 




X71635; 
T62636; 
T62491 


CXC chemokine 

receptor 

4=fusin?Tneuropeptide 
Y Teceptor=13 


188- 
189 


JUND 


767784 




AA418670; 
AA418676 


Jun D proto-oncogene 


190- 
192 


BRF2 


485770 




U07802; 
AA039882; 
AA039967 


Tislld=ERF- 
2=growth factor early 
response gene 


193- 
195 


A2M 


377647 




M11313; 
AA055995; 
AA055907 


Alpha-2- 
macroglobulin 


196 


EST 


1384797 




AA856938 


Homo sapiens noRNA; 
cDNA 

DKFZp434O0227 
(&om clone 
DKFZp434O0227) 


197- 
198 


CD24 


196519 




S75311; 
R91610 


CD24 


199- 
201 


TLR3 


144675 




U88879; 
R76099; 
R76150 


TLR3= Toll-like 
receptor 3 


202 


ITM2A 


878596 


Hs.17109 


AA775257 


integral membrane 
protein 2A 


203- 
204 


GATM 


42558 


Hs.75335 


R61229; 
R61228 


glycine 

amidinotransferase (L- 

arginine:glycine 

amidinotransferase) 


205- 
207 


RNASE4 


81417 


Hs.283749 


D37931; 
T60163; 
T60223 


ribonuclease L (2',5'- 

oligoisoadenylate 

synthetase-dependent) 


208- 
210 


LAMA2 


471642 


-Hs.75279 


- Z26653; 

AA034939; 

AA034938 


laTnitiin alpha 2 

(merosin, congenital 
muscular dystrophy) 


211 


PBX3 


448386 


Hs.294101 


AA778198 


pre-B-cell leukemia 
transcription factor 3 


212 


PLA2G6 


1472538 


Hs.120360 


AA872271 


phospholipase A2, 
groiq> VI (cytosoUc, 
calcium-independent) 


213 


SMARC 
A2 


814636 


Hs. 198296 


AA481026 


SWI/SNF related, 
matrix associated, 
actin dependent 
regulator of chromatin, 
subfamily a, meniber 2 


214- 
215 


CUGBP2 


488956 


Hs.211610 


AA047257; 
AA057142 


CUG triplet repeat, 
RNA-binding protein 
2 


216- 
218 


TGEBR3 


209655 


Hs.79059 


L07594; 
H62473; 
H61499 


TGF beta receptor 
typem 


219 


STAR 


859858 


Hs.3132 


AA679454 


steroidogenic acute 
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regulatory Piotein 


220 


GNGll 


1636447 


Hs.83381 


AA999901 


guanine nucleotide 
binding protein 11 


221 


CTTEDZ 


491565 


Hs.82071 


AAl 15076 


Cbp^300-mteracting 
transactivator, vnih 
Glu/Asp-rich carboxy- 
tenninal donoain, 2 


222 


CTNNAL 
1 


744647 


Hs.58488 


AA621315 


catenin (cadherin- 
associated protein), 
alpha-like 1 


223 


ABCA8 


743773 


Hs.38095 


AA634308 


ATP-binding cassette, 
sub-family A (ABCl), 
member 8 


224- 
226 


KLF4 


188232 


Hs.7934 


AF105036; 
H45668; 
H45711 


GKLF=EZF«KLF4=g 
ut-enriched Kruppel- 
like zinc finger 
protein=expressed in 
vascular endothelial 
cells 


227- 
228 


rrpRi 


471725 


Hs. 198443 


AA035450; 
AA035477 


inositol 1,4,5- 
triphosphate receptor, 
typel 


229- 
230 


MAF 


487793 


Hs.30250 


AA043501; 
AA044658 


v-maf 

musculoaponeurotic 
fibrosarcoma (avian) 
oncogene homolog 


231- 
232 


FOXCl 


768370 


Hs.284186 


AA495790; 
AA495846 


forkhead box CI 


233 


TCF21 


461351 


Hs.78061 


AA699782 


transcription £ictor 21 


234- 
236 


CCNI 


248295 


Hs.79933 


D50310; 
N58511; 
N78101 


Cyclinl 


237- 
238 


DCN 


209367 


Hs.76152 


H64138; 
H64086 


decorin 


239- 
240 


CBF2 


789049 


Hs. 184760 


AA452909; 
AA453077 


CCAAT-box-binding 
transcription factor 


241- 
242 


EST 


68049 


H8.180324 


T52830; 
T52829 


Homo sapiens, clone 
IMAGE:4183312, 
mRNA, partial cds 


122 


RNASE4 


81417 


Hs.283749 




ribonuclease L (2',5'- 

oligoisoadenylate 

synthetase-dependent) 


243 


SLC4A1 
AP 


2094012 


Hs.306000 


AI424433 


solute carrier family 4 
(anion exchanger), 
member 1, adapter 
protein 


244- 
245 


GSTM5 


377731 


Hs.75652 


AA056232; 
AA056231 


glutathione S- 

transferase M5 


246 


C4BPB 


460470 


Hs.99886 


AA677687 


complement 
component 4-binding 
protein, beta 


247- 
248 


HS3ST1 


73609 


Hs.40968 


T55714; 
T55756 


heparan sulfate 
(glucosamine) 3-0- 
sulfotransferase 1 


249 


CDKNIC 


2413955 


Hs.106070 


AI828088 


cyclin-dependent 
kinase inhibitor IC 
(p57.Kip2) 


250 


HNRPDL 


.897823 


Hs.170311 


AA598578 


heterogeneous nuclear 
ribonucleoprotein D- 
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like 


251 


CIRBP 


1558799 


Hs.l 19475 


AA977242 


cold inducible KNA- 
binding piotein 


252 


RGS2 


2321596 


Hs.78944 


AI675670 


regulator of G-protein 
signalling 2, 24kD 


253- 
254 


TCEALl 


786607 


Hs.95243 


AA478480; 
AA451969 


transcription 
elongation &ctor A 
(SID-Iikel 


255- 
256 


CAVl 


377461 


Hs323469 


AA055835; 
AA055368 


caveolin 1, caveolae 
protein, 22kD 


257 


ALDHIA 
1 


855624 


Hs.76392 


AA664101 


aldehyde 
dehydrogenase 1 

fsnnly, member Al 


258- 
259 


RBPMS 


343443 


Hs.80248 


W67200; 
W67323 


KNA-binding protein 
gene with multqple 
splicing 


260- 
261 


ADAMT 
SI 


62263 


H8.8230 


T41173; 
T40309 


a disintegrin-like and 
metalloprotease 
(reprolysin type) with 
thronibospondin type 
1 motif, 1 


262- 
263 


DSCRIL 
1 


51408 


Hs.156007 


H19440; 
H19439 


Down syndrome 
critical region gene 1- 
likel 


264 


DLKl 


436121 


Hs.169228 


AA701996 


delta-like homolog 
(Drosophila) 


265- 
266 


CDHll 


491113 


Hs.75929 


AA136983; 
AA137109 


cadherinll, type 2, 
OB-cadherin 

(osteoblast) 


138 


SGK 


840776 


Hs.296323 




sgk=putative 
serine/tfazeonine 
protein kinase 
transcriptionally 
modified during 
anisotonic and isotonic 
alterations of cell 
voliune 


267 


HFLl 


450060 


Hs.278568 


AA703392 


H factor 

(complement)-like 1 


268- 
269 


FOG2 


38347 


Hs.106309 


R49439; 
R35921 


Friend of GATA2 .. 


270 


C7 


898122 


Hs.78065 


AA598478 


complement 
component 7 


271- 

272 


SGCE 


784109 


Hs.l 10708 


AA432066; 

AA446750 


sarcoglycan, epsilon 


273 


NBLl 


898305 


Hs.76307 


AA598830 


neuroblastoma, 
suppression of 
tumorigenicity 1 


274- 
275 


HBB 


173385 


Hs.155376 


H20968; 
H210n 


hemoglobin, beta 


276- 
278 


CARP 


840683 


Hs.31432 


X83703; 
AA488072; 
AA486364 


Cytokine inducible 
nuclear protein 


279- 
280 


MTTF 


278570 


Hs.166017 


N66177; 
N99168 


microphthalmia- 
associated 
transcription &ctor 


281- 
282 


CDC20 


898062 


Hs.82906 


U05340; 
AA598776 


p55CDC 


283- 


EPS8 


148028 


Hs.2132 


U12535; 


epidennal growth 
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H13623; 
H13622 


factor receptor kinase 
substrate (Eps8) 


286- 

287 


ARHI 


345680 


Hs.194695 


W72033; 
W76278 


ras homolog gene 
family, menober I 


288 


B4-2 


857002 


Hs.75969 


AA669637 


pioline-rich protein 
with nuclear targeting 
signal 


289- 
291 


SELB 


186132 


H8.89546 


M30640i 
H39991; 
H39560 


ELAMl==^dothelial 
leukocyte adhesion 
molecule I 


292- 
293 


PMP22 


133273 


Hs. 103724 


R26732; 
R26960 


peripheral myelin 
protein 22 


294 


EBAF 


340657 


Hs.25195 


W56771 


endometrial bleeding 
associated factor (left- 
right determination, 
factor A; transforming 
growth factor beta 
superfamily) 


295- 
296 


PRKAR2 
B 


609663 


Hs.77439 


AA180007; 
AA181500 


protein kinase, cAMP- 
dependent, regulatory, 
type n, beta 


297 


NFKBIE 


1573311 


Hs.91640 


AA953975 


nuclear factor of kappa 
light polypeptide gene 
enhancer in B-cells 
inhibitor, epsilon 


299 








N36279 


v-jQi jiaroy- 
Zuckeiman 4 feline 
sarcoma viral 

r\Ti r* r\ cti^t\ ^ \y/\tY\f\\f\CT 


158 


JUNB 


309864 


Hs.198951 




jun B proto-oncogene 


300 


BCKDK 


1573108 


Hs.20644 


AA970731 


branched chain alpha- 
ketoacid 

dehydrogenase kinase 


301- 
303 


BTGl 


298268 


H8.77054 


X61123; 
N70463; 
W03824 


BTGl=B-ceU 

translocation gene 
l=anti-proliferative 


304- 
305 


AKAF12 


784772 


Hs.788 


AA478543; 
AA478542 


A kinase (PRKA) 
anchor protein 
(gravin) 12 


306- 
307 


NR4A2 


898221 


Hs.82120 


S77154; 
AA598611 


l«IOT«Enmediate early 

response 

protein=Nurrl 

homologue=Nurr77 

orphan steroid 

receptor family 

meniber 


SOS- 
SOP 


HBB 


126531 


Hs. 155376 


R06757; 
R06806 


hemoglobin, beta 


310- 
311 


ARHGAP 
6 


768489 


Hs250830 


AA495981; 
AA425035 


Rho GTPase 
activating protein 6 


312 


PLS3 


1568391 


Hs.4114 


AA953747 


plastin 3 (T isoform) 


313- 
314 


FNTA 


300015 


Hs.138381 


N78902; 
W06970 


famesyltransferase, 
CAAX box, alpha 


315- 
316 


TNFAIP3 


770670 


H8.211600 


AA476272; 
AA433807 


tumor necrosis factor, 
alpha-induced protein 
3 


317- 
318 


EGRl 


840944 


Hs.326035 


AA486533; 
AA486628 


early growth response 
1 


319 


RNAC 


795213 


Hs.l 13052 


AA453591 


RNA cyclase homoloe 
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320 


PA26 


813584 


Hs.14125 


• AA447661 


p53 regulated PA26 
oncleai protein 


321 


Clloifl3 


1573778 


Hs.72925 


AA970526 


chromosome 11 open 
reading frame 13 


322 


INGIL 


2169465 


Hs.107153 


AI564029 


inhibitor of growtli 
family, member 1-like 


323 


RPL9 


2577249 


Hs. 157850 


AW075605 


ribosomal protein L9 


324- 
325 


ADH5 


813711 


Hs.78989 


AA453776; 
AA4538S9 


alcohol dehydrogenase 
S(classni),chi 

polypeptide 


326- 
327 


FZD7 


298122 


Hs. 173859 


N69049; 
W00697 


frizzled (Drosopbik) 
homolog 7 


328- 
329 


MATN2 


366100 


Hs.19368 


AA071473; 
AA082338 


matrilin2 


330- 
331 


SLC11A3 


71863 


H8.5944 


T52564; 
T57235 


sohrte carrier &niily 
11 Qiroton-coitpled 
divalent metal ion 
transporters), member 
3 


332- 
333 


EST 


767641 


Hs.122460 


AA418293; 
AA418356 


ESTs 


334- 
335 


ERCC5 


292463 


Hs.48576 


N62586; 
N80359 


excision repair cross- 
conqjlementing rodent 
repair deficiency, 
coaq)lementation 
group 5 (xeroderma 
pigmentosmn, 
complementation 
group G (Cockayne 
symtome)) 


336- 
• 337 


MGC247 
9 


43771 


H8.79625 


H05655; 
H05654 


hypothetical protem 
MGC2479 


338- • 
339 


RPL21 


810617 


Hs.184108 


AA464743; 
AA464034 


ribosomal protein L21 


182 


CD24 


204335 


Hs.286124 




CD24 


340- 
341 


SLPI 


378813 


Hs^51754 


X04470; 
AA683520 


Secretory leukocyte 
protease inhibitor 


342 


SPPl 


378461 


Hs.313 


AA775616 


Secreted 
phosphoprotein 1 
(osteopontin, bone, 
sialoprotein I, early T- 
lymphocyte activation 

1) 


343- 

344 


BF 


741977 


Hs.69771 


L15702; 
AA401441 


B-factOT, properdin 


345- 
347 


CKSl 


810899 


Hs.334883 


X54941; 
AA459292; 
AA459522 


CDC28 protein kinase 
1 


348- 
349 


MMP7 


470393 


H8.2256 


AA031514; 
AA031513 


Matrix 

metalloproteinase 7 
^oatrilysin) 


350- 

351 


PAX8 


742101 


Hs.73149 


AA405767; 

AA405891 


Paired box gene 8 


352- 
353 


SPINT2 


814378 


Hs.31439 


AA458849; 
AA459039 


Serine protease 
inhibitor, Kunitz type, 
2 


354 


ZWINT 


451907 


Hs.42650 


AA706968 


ZWIO interactor 


355 


DGKH 


2544675 


Hs.159073 


AW052032 


Diacylglycerol kinase, 
eta 
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356 


HMGIY 


782811 


H8.139800 


AA448261 


HMGIY Higb- 
mobility gfoup 
(nonhistone 
chromosomal) protein 
isofoims I and Y 


357- 
359 


SDC4 


504763 


Hs.252189 


X67016; 
AA148737; 
AA148736 


Syndecan4 

(anqihiglycan, 

ryudocan) 


360 


.CDKN2A 


1161155 


Hs.1174 


AA877595 


Cyclin-dependent 

kinase inhibitor 2A 
(melanoma, pl6, 
inhibits CDK4) 


361- 
362 


SCNNIA 


810873 


Hs.2794 


AA458982; 
AA459197 


Sodium channel, 
nonvoltage-gated 1 
alpha 


363 


LDHA 


43550 


Hs.2795 


H05914 


Lactate dehydrogenase 
A 


364- 
365 


FOLRl 


131839 


H8.73769 


R24530; 
R24635 


Folate receptor 1 


366- 
367 


TPIl 


855749 


Hs.83848 


M10036; 
AA663983 


Triosephosphate 
isomerase 1 


368 


KLK8 


2514426 


Hs.104570 


AI963941 


KaUikrein8 
(neurqpsin/ovasin) 


200 


CXCR4 


79629 


H8.89414 




ChemoMne (C-X-C 
motif), receptor 4 
(fiisin) 


369- 
370 


KNSLl 


825606 


Hs.8878 


AA504625; 
AA504719 


Kinesin-like 1 


371- 
372 


H2AFO 


488964 


H8.795 


AA047260; 
AA057146 


H2A histone &mily, 
member O 


373- 
374 


HLA- 
DRA 

• 


153411 


H8.76807 


R47979; 
R48091 


Major 

histoconq)atibility 
complex, class U, DR 
alpha 


375 


CRIPl 


1323448 


Hs. 17409 


AA873604 


Cysteine-rich protein 1 


376 


PP 


950700 


Hs.184011 


AA608572 


pyrophosphatase 
(inorganic) 


377- 
378 


EST 


666391 




AA232895; 
AA232894 




379- 
381 


SLC2A1 


207358 


Hs.169902 


-K03195; 
H58873; 
H58872 


Solute carrier family 2 
(facilitated glucose 
transporter), member 1 


382 


EST 


897770 




AA598508 




383- 
385 


HDGF 


813673 


Hs.89525 


D16431; 
AA453749; 
AA453831 


Hepatoma-derived 
growth factor (high- 
mobility group protein 
l-like) 


386 


ASS 


882522 


Hs.160786 


AA676466 


Argininosuccinate 
synthetase 


387- 
388 


CLDN4 


770388 


H8.5372 


AA430665; 
AA427468 


Claudin4 


389 


FRAME 


897956 


Hs.30743 


AA598817 


preferentially 
expressed anttgea in 
melanoma 


390- 
391 


PTPRF 


897788 


Hs.75216 


Y00815; 
AA598513 


Protein tyrosine 
phosphatase, receptor 
type,F 


392- 
393 


EYA2 


741139 


Hs.29279 


AA402754; 
AA402207 


Eyes absent 
(Drosophila) homolo^ 
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2 


394- 
396 


MYCLl 


138917 


Hs.92137 


. M19720; 
R62813; 
R62862 


v-myc 

myelocytomatosis 
viial oncogene 
homolog 1 


397- 
399 


STATl 


840691 


Hs.21486 


M97935; 
AA488075; 
AA486367 


Signal transducer and 
activator of 
transcription 1 


400- 
401 


MTCH2 


564492 


Hs.279609 


AA121668; 
AA121740 


mitochondrial carrier 
homolog 2 


402 


HTR3A 


435597 


Hs2142 


AA703169 


5-hydroxytr)ptamine 
(serotonin) receptor 
3A 


403- 
404 


CCNEl 


68950 


Hs.9700 


T54121; 
T54213 


CyclinEl 


405 


CDH6 


739155 


Hs.32963 


AA421819 


Cadheiin6,type2,K- 
cadherin 


406- 
408 


PRKAQl 


531028 


Hs.3136 


U42412; 
AA070495; 
AA070381 


Protein kinase, AMP- 
activated, gamma 1 
non-catalytic subunit 


409 


DEFBl 


2403485 


Hs.32949 


AI769855 


Defensin, beta 1 


410- 
411 


ARPCIB 


626502 


Hs.11538 


AA188179; 
AA188155 


Actin related protein 
2/3 conq>lex, sobunit ' 
IB (41 kD) 


412- 
414 


PRKCI 


71622 


Hs.1904 


L33881; 
T57875; 
T57957 


Protein kinase C, iota 


415 


GAPD 


1610448 


Hs.169476 


AA991856 


glyceraldehyde-3- 

phosphate 

dehydrogenase 


416- 
417 


C2 


85497 


Hs.2253 


T71879; 
T71878 


Ck>niq)lement 
Component C2 


418- 
419 


H2AFY 


843075 


Hs.75258 


AA488627; 
AA486003 


H2A histone family, 
member Y 


420- 
421 


TM4SF1 


840567 


Hs.3337 


AA487893; 
AA488005 


Transmembrane 4 
superfanuly member 1 


422- 
423 


GAPD 


50117 


Hs.169476 


H16958; 
H16957 


glyceraldehyde-3- 

phosphate 

dehydrogenase 


424- 
426 


IFITM3 


809910 - 


Hs.182241 


X57352; - 
AA464417; 
AA464416 


interferon induced 
transmembrane protein 
3 (1-8U) 


427- 
428 


GLDC 


248261 


Hs.27 


N58494; 
K78083 


Glycine 
dehydrogenase 
(decarboxylating; 
glycine decarboxylase, 
glycine cleavage 
system protein P) 


429- 
430 


CALU 


144881 


Hs.7753 


R78586; 
R78585 


Calumenin 


431- 
432 


HBA2 


208764 


Hs.272572 


H63096; 
H63182 


Hemoglobin, alpha 2 


433 


SlOOAll 


810612 


Hs.256290 


AA464731 


SlOO calcium-binding 
protein All 
(calgizzarin) 


434- 
436 


LDHA 


897567 


Hs.2795 


X02152; 
AA497029; 
AA489611 


Lactate dehydrogenase 
A 


437 


UBE2C 


769921 


Hs.93002 


AA430504 


Ubiquitin-conjugatin^ 
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enzyme E2C 


438- 
440 


E2F3 


304908 


Hs.1189 


D38550; 
N92519; 
W3884I 


E2F transcription 
factors 


441- 
442 


CDHl 


251019 


Hs. 194657 


Z13009; 
H97778 


Cadherin 1, type 1, E- 
cadherin (epithelial) 


443- 
444 


PSME2 


210405 


Hs.179774 


H65395; 
H65394 


Proteasome (prosome, 
macropain) activator 
sabunit 2 (1PA28 beta) 


445- 
447 


BMP7 


344430 


Hs.170195 


X51801; 
W73473; 
W73527 


bone moiphogenetic 
protein 7 (osteogenic 
protein 1) 


448 


T0P2A 


825470 




AA504348 


topoisomerase (UNA) 
n alpha (170kD) 


449- 
451 


IL8 


328692 


Hs.624 


M17017; 
W45324; 
W40283 


interleuldn 8 


452- 
453 


GROl 


324437 


Hs.789 


X54489; 
W46900 


GROl oncogene 
(melanoma growth 
stimulating activity, 
alpha) 


454- 
456 


ALDHIA 
3 


272686 


Hs.75746 


U07919; 
N32289; 
N44575 


aldehyde 
dehydrogenase 1 
fandly, member A3; 
Aldehyde 
dehydrogenase 6 


457- 
459 


MMPl 


624924 


Hs.83169 


X54925; 
AA181875; 
AA186634 


matrix 

metaUoproteinase 1 

(interstitial 

collagenase) 


460- 
461 


osaF-2 


897910 


Hs.136348 


D13666; 
AA598653 


osteoblast specific 
factor 2 (fasciclin I- 
like) 


462- 
464 


CDC25B 


48398 


Hs.153752 


S78187; 
H14343; 
H14392 


cell division cycle 
25B; M-phase inducer 
phosphatase 2 


465- 
467 


FLNA 


487418 


H8.195464 


X53416; 
AA046721; 
AA046606 


filamin A, alpha 
(actin-binding protein- 
280) 


468- 
469 


TFP12 


726086 


Hs.295944 


AA399473; 
AA293402 


tissue .£ictor pathway 
inhibitor 2 


470- 
472 


FGF2 


23073 


Hs^84244 


M27968; 
R38539; 
T75110 


fibroblast growth 
factor 2 (basic) 


473- 
475 


CD44 


713145 


Hs.169610 


X56794; 
AA283090; 
AA282906 


CD44 antigen; 
extracellular matrix 
receptor- 
in=Hyaluronate 
receptor 


476- 
477 


DYTl 


69046 


Hs.19261 


T54320; 
T53726 


dystonia 1, torsion 
(autosomal dominant; 
torsin A) 


478 


vasLi 


878833 


Hs.76118 


AA670438 


ubiquitin carboxyl- 
terminal esterase LI 
(ubiquitin 
Molesterase) 


479- 
480 


PLAU 


1696513 


Hs.77274 


D00244: 
AI088434 


plasminogen activator, 
urokinase 


256 


LDHA 


43550 


Hs.2795 




lactate dehydrogenase 
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A 


481- 
483 


PTGS2 


147050 


Hs.196384 


U04636; 
R80217; 
R80322 


cyclooxygenase-2; 
prostaghmdin 
endoperoxide 
syii11use-2 


484- 
486 


PRNP 


682013 


Hs.74621 


M13899; 

AA256322; 
AA256449 


prion protein 


487- 
488 


MTIX 


297392 


Hs.278462 


N80129: 
W03653 


metallothionein IL, 
metalloUiionem IX 


489- 
490 


UGB 


81336 


Hs.2240 


T63761; 
T63800 


uteroglobin 


491- 
493 


PBEF 


594539 


Hs.239138 


U02020; 
AA169813; 
AA171651 


pre-B-cell colony- 
enhancing factor 


494- 
496 


TXNRDl 


789376 


Hs.13046 


X91247; 
AA464849; 

AA453335 


thioredoxin reductase 
1; ORIM-12 


497- 
499 


NTS 


21655 


Hs.153952 


X55740; 
T65120; 
T65189 


5' nucleotidase (CD73) 


500- 
502 


MT2A 


590150 


Hs.l 18786 


J00271; 
AA156031; 
AA156201 


metallothionein 2A 


503 


ZNF220 


949928 


Hs.82210 


AA599173 


zinc finger protein 220 


504- 
506 


AKTl 


810331 


Hs.71816 


U97276; 
AA464152; 
AA464217 


BPQF-l=bone-derived 
growth &ctor; v-akt 
murine thymoma viral 
oncogene homolog 1 


507- 
509 


PTEN 


322160 


Hs.10712 


U92436; 
W37864; 
W37855 


MMACl=PTEN=Tum 
or siq>pressor gene at 
10q23.3tiiatis 
Mutated in Multiple 
Advanced 

Cancers=Pho8phatase 
and trasin homolog 


510- 
512 


UBH 


758495 


Hs.81424 


U83117; 
AA401634; 
AA401864 


ubiquitin-homology 
domam protein PICl 


513- 
514 


WNT2 


302286 


Hs.89791. 


N78828; 
W17194 


. wingless-type MMTV 
integration site femily 
member 2 


515- 
517 


SFRP4 


841282 


Hs.105700 


AF026692; 
AA487193; 

AA486838 


fiizzled related protein 
fipHE 


518- 
520 


RUNXl 


263251 


Hs.129914 


D43968; 
H99599; 
H99598 


AMLlProto- 
oncogene 


521- 
523 


TALI 


71727 


Hs.73828 


X51990; 
T51236; 
T51350 


T-cell acute 
lynq)hocytic leukemia 
1 


524- 
526 


WAS 


236282 


Hs.2157 


U12707; 
H61193: 
H62098 


Wiskott-Aldrich 
syndrome protein 


527- 
528 


PCTKl 


713382 


Hs.171834 


X66363; 
AA28312S 


PCTAIREl 
serine/threonine 
protein kinase 


529 


EBP 


295986 


Hs,75105 


N67038 


emopamil-binding 
protein (sterol 
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isomeiase) 


530 


SMCILI 


897997 


Hs.211602 


AA598887 


SMCl (stiuctuial 
maintenance of 
chromosomes 1, 
yeast)-]ike 1 


531- 
532 


ARAFl 


207618 


Hs.77183 


H59758; 
H59757 


v-raf murine sarcoma 
361 1 viral oncogene 
homolog 1 


533 


XJBEl 


898262 


Hs.2055 


AA598670 


ubiquitin-activating 
enzyme El 


534- 

535 


LOC5176 

0 


52226 


Hs.26971 


H23265; 
H23376 


B/K protein 


536- 
537 


LRPAPl 


842785 


Hs.75140 


AA486209; 
AA486313 


low density 

hpoprotein-related 

protein-associated 

protein 1 (alpIia-2- 

macroglobulin 

receptor-associated 

protein 1) 


538- 
540 


PSTPIPl 


71434 


Hs.129758 


L07633; 
T47815; 
T47814 


interferon-gamma lEF 
SSP 5111; Interferon 
gamma upregulated 
protein 


541- 
542 


IDH2 


869375 


Hs.5337 


X69433; 
AA679907 


isocitrate 
dehydrogenase 2 
(NADP+), 
mitochondrial 


543 


SIAHBPl 


85469.6 


Hs.74562 


AA630094 


fuse-binding protein- 
interacting repressor 


544 


SLC25A1 
1 


878413 


Hs.184877 


AA670357 


solute carrier family 
25 (mitochondrial 
carrier; oxoglutarate 
carrier), member 11 


545- 
547 


LCN2 


302127 


Hs.204238 


X99133; 
N79823; 
W38398 


lipocalin 2 (oncogene 
24p3) 


548- 
550 


TaEB2 


52162 


Hs.172772 


L42856; 
H22966; 
H24146 


ElonginB=RNA 
polymerase U 
transcrq)tion &ctor 
SinplSsubumt 


551- 
553 


NM23H1 


176482 




X17620; 

H42520; 
H43520 


nm23-Hl"NDP kinase 

A=Nucleoside 
dephophate kinase A 


554- 
556 


SCYB5 


198699 


Hs.89714 


X78686; 
R95077; 
R95145 


ENA78'-chemokine 


557- 
558 


PAK2 


231951 


Hs^84275 


U25975; 
H92785 


hPAK65=SER/THR- 
protein kinase PAK- 
gamma =P21-activated 
kinase 3 


559- 
561 


S100A4 


472180 


Hs.81256 


M80563; 
AA057375; 
AA036758 


SlOO calcium binding 
protein A4=Placental 

calcium binding 

protein=Calvasc\jlin= 

mtsl 

PROTEIN=CAPL 


562- 
564 


CD83 


564503 


Hs.79197 


Z11697; 
AA101749; 
AA101748 


CD83=B-G antigen 
IgV domain 
homolog=B-cell 
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activation 
proteiii=HB15 


565- 
567 


NCOAl 


609445 


Hs.74002 


U59302; 
AA180462; 
AA179970 


SRC-l=steroid 
receptor coactivator 


568- 
570 


RYBP 


649654 


Hs.7910 


AF179286; 
AA216739; 
AA216519 


Dealh effector 
domain-associated 
&ctoi=Binds to 
Caspase 10 DED 
domain=HomoIog of 
mouse RYBP 
repressor protein that 
interacts vdth 
Polyconib conq)lex 
aQdYYl=YAF2 
homolog=DEDAF=Y 
AF2 

homolog=^ffLNewGen 
e3 


571- 
573 


ITGAE 


665279 


Hs.851 


L25851; 
AA195282; 
AA195146 


CD 103 alph^Integrin 
a^ha-E 


574- 
576 


IL7 


701422 


Hs.72927 


J04156; 
AA287945; 
AA288010 


IL-7 


577- 
579 


CD36 


243816 


Hs.75613 


M98399; 
N39161; 
N45238 


0)36 


580- 
582 


PDGFRB 


773439 


Hs.76144 

• 


J03278; 
AA426020; 
AA428115 


Platelet-derived 
growth factor receptor, 
beta 

polypeptide=jused to 
TEL in 

t(5;12)(q33,-pl3) 
chronic 

myelomonocytic 
leukemia 


583- 
584 


IL17R 


842122 


Hs.129751 


U58917; 
AA634809 


IL-17 receptor 


585- 
586 


HGF 


1219612 


Hs.809 


X16323; 
AA687773 


Hqpatocyte growjth . 
&ctor (hqpapoietin A; 
scatter Victor) 


587- 
588 


BAD 


1286754 


Hs.76366 


U66879; 
AA740876 


BAD=bbc6=proapopto 
tic Bcl-2 homolog 


589- 
591 


Z1SIF173 


755176 


Hs.1287 


U09825; 
AA421953; 
AA421952 


acid finger protein 


592- 
593 


2FP161 


285742 


Hs. 156000 


D89859; 
N64141 


ZF5=POZ domain 
2dnc finger protein 


594- 
596 


RGS16 


470132 


Hs.183601 


U70426: 
AA029960; 
AA029959 


A28-RGS14p=G 
protein signaling 
regulator 


597- 
599 


PPPICB 


485729 


Hs.21537 


X80910; 
AA040285; 
AA040284 


PPPlCB=Protein 
pho^hatase 1, 
catalytic sobuni^ beta 
isoform 


600- 
602 


GART 


502761 


H8.82285 


X54199; 

AAl 26256; 
AA126360 


Phospboribosylglycina 
mide 

formyltcansferase. 
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phospho- 

ribosylglycinamide 
synthetase, 

phosphoribosylaniiiioi 
nddazole syn^tase 


603- 
605 


ENPPl 


786041 


Hs.11951 


D12485^ 
AA448639; 
AA448731 


PC-1 = alkaline 

nucleotide 

pyrophosphatase 


606- 
608 


MMP13 


786029 


Hs.2936 


X75308; 
AA448634; 
AA448726 


MMP-13=Nfetrix 
metalloproteinase 
13=CL-3=Collagenase 
3 


609- 
611 


ILK 


292313 


Hs.6196 


U40282; 
N62542; 

N79210 


ILK=integrm-luiked 
kinase 


612- 
614 


SCYA4 


205633 


Hs.75703 


J04130; 
H62864; 
H62985 


MIP-1 

beta=SCA"y2=G- 

26=HC21=pAT 

744=LAG-l=Act- 

2=H400=SIS- 

ganima=H;hemokine 


615- 
617 


KDR 


469345 


Hs.12337 


AF035121; 
AA027012; 
AA026831 


Kinase insert domain 
receptor (a type in 
receptor tyrosine 
kinase) 


618- 
620 


E,18R1 


755054 


H8.159301 


U43672; 
AI821652; 
AI734039 


IL-18 receptor 1=IL- 
IRip 


621- 
623 


PPP2R5A 


41356 


Hs.155079 


L42373; 
R59165; 
R59164 


phosphatase 2A B56- 
alpha(PP2A) 


624- 
625 


PTK2B 


180298 


Hs.20313 


U43522; 
R85257 


protein tyrosine kinase 
PYK2 


626- 
628 


MAP2K3 


45641 


H8.180533 


D87116; 
H08749; 
H08467 


Dual specificity 
mitogen-activated 
protein kinase kinase 3 


629- 
631 


TNFR2R 
P 


124034 


H8.1 17847 


L04270; 
R02558; 
R02676 


Lymphotoxin-Beta 
receptor precursor = 
Tumor necrosis factor 
receptor 2 related 
protein = Tumor 
necrosis factor C 
receptor 


632- 
633 


EST 


739852 


Hs.328687 


AI821550; 
AA477842 


ESTs, Moderately 
similar to 

ALU4_HUMANALU 

subfaiidly SB2 
sequence 

contamination warning 
entry [H.sapiens] 


634- 
635 


EST 


1862171 


Hs.3 10541 


AI053777; 
AI792563 


ESTs 


636 


EST 


1985026 




AI251605 


Unknown 


637 


EST 


2002071 


Hs.203960 


AI249848 


ESTs 


638 


EST 


2047317 




AI3 11297 


Unknown 


639 


EST 


2215752 




AI567814 


Unknown 


640 


EST 


2217459 




An44181 


Unknown 


641 


EST 


2217834 


HS.3284S1 


An44330 


EST, Weakly similar 
toPRPP HUMAN 
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salivary prolioe-rich 
protein II-l 
[Rsapiem] 


642 


EST 


2219300 




AI745684 


Unlaiown 


643 


EST 


2220085 


Hs.337231 


AI798317 


EST 


644 


EST 


2220214 




An98385 


Uakaown 


645 


EST 


2261174 


Hs. 185554 


AI609326 


EST 


646 


EST 


2261169 




AI609331 


Unknown 


647 


EST 


2292810 


Hs.224732 


AI871658 


EST 


648 


EST 


2292831 


Hs.337311 


AI871678 


EST 


649 


EST 


2549950 




AI954130 


Unknown 


650 


EST 


2550130 




AI953438 


Unknown 


651- 
652 


FDFTl 


25725 


Hs.48876 


R36960; 
R11842 


famesyl-diphosphate 
famesyltransferase 1 


653- 
654 


NAGA 


28985 


Hs.75372 


R40255; 
R14305 


N- 

acetylgalactosaminida 
se, alpha- 


655- 
656 


SECRET 


29054 


Hs. 116428 


R40850; 
R14422 


secietagogtn 


657- 
658 


SLC9A1 


30272 


H8.170222 


R42414; 
R14692 


solute carrier fiunily 9 
(sodimn/hydrogen 
exchanger), isofoim 1 
(antiporter, Na+/H+, 
axniloride sensitive) 


659 


TUFM 


34945 


Hs. 12084 


R45183 


Tu translation 
elongation fiictor, 
mitochondrial 


660- 
661 


MNATl 


38471 


Hs.82380 


R49475; 
R3S961 


menage a trois 1 
' (CAK assembly 
&ctor) 


662- 
663 


HARS 


43021 


Hs.77798 


R601S0; 
R60149 


histidyl-fRNA 
synthetase 


664- 
665 


EIF4A1 


46171 


Hs.129673 


H09590; 
H09589 


eukaryotic translation 
initiation factor 4A, 
isofonn 1 


666- 
667 


MPI 


50359 


Hs.75694 


H17096; 
H17714 


mannose phosphate 
isomerase 


668- 
669 


TAGLN2 


45544 


Hs.75725 


H08564; 
H08563 


transgelin2 


670- 
671 


PONl 


128143 


Hs.1898 


R12373; 
R09781 


paraoxonase 1 


672- 
673 


EST 


813444 


Hs.178379 


AA455945; 
AA455554 


ESTs 


674- 
675 


QNB2 


292213 


Hs.91299 


N68166; 
N80625 


guanine nucleotide 
binding protein (G 
protein), beta 
polypeptide 2 


676- 
677 


KIAA036 
5 


811029 


Hs.190452 


AA485383; 
AA485539 


KIAA0365 gene 
product 


678 


NCSTN 


199645 


Hs.4788 


R96527 


nicastrin 


679- 
681 


ARHGEF 
6 


687990 


Hs.79307 


D25304; 
AA236957; 
AA236617 


KIAA0006 


682 


SFRSll 


204755 


Hs. 11482 


H56944 


splicing &ctor, 
arginine/serine-rich 11 


683- 
684 


CUGBPl 


25588 


Hs.81248 


R15111; 
R12181 


CUG triplet repeat, 
RNA-binding protein 
1 


685- 


GABRP 


563598 


Hs.70725 


AA101225; 


gamma-aminobutyric 
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AA102670 


acid(aABA)A 

receptor, pi" 


687- 
688 


BMP6 


768168 


Hs.285671 


AA424833; 
AA426586 


bone morphogenetic 
protein 6 


689- 
690 


A'IT7A 


687820 


Hs.606 


AA236141; 
AA23663S 


ATPase, Cu++ 
transportiiig, alpha 
polypeptide (Menkes 
syndtome) 


691- 
692 


RBBP4 


773599 


Hs.16003 


AA428365; 
AA429422 


retinoblastoma- 
binding protein 4 


693- 
694 


P0UR2A 


740130 


Hs.171880 


AA479052; 
AA477535 


polymerase (RNA) n 
(DNA directedQ 
polypeptide A 
C220kD) 


695- 
696 


SMGl 


785605 


Hs.l 10613 


AA449463; 
AA448998 


PI-3-kinase>telated 

kinase 

SMG-1 


697- 
698 


GTPBPl 


826217 


Hs.283677 


U87964; 
AA521469 


GP-l=putative G- 
protein 


699- 
700 


GS2NA 


767994 


Hs.183105 


AA418821; 
AA418918 


nuclear autoantigen 


701- 
702 


FLJ12442 


32231 


Hs.84753 


R42815; 
R17469 


hypothetical protein 
FLJ12442 


703- 
704 


KIAA021 
8 


49404 


Hs.75863 


H15567; 
H15627 


KIAA0218 gene 
product 


705- 
706 


KIAA014 
4 


245015 


H8.8127 


N52646; 
N72374 


KIAA0144gene 
product 


707- 
708 


FOXOIA 


628955 


Hs.170133 


AA194765; 
AA194764 


forkheadbox OlA 
(rhabdomyosarcoma) 


709 


CSRP2 


75254 


Hs.10526 


T59334 


cysteine and glycine- 
rich protein 2 


710- 
711 


BR£ 


739993 


Hs.80426 


AA479741; 
AA477082 


brain and reproductive 
organ>expressed 
(TNFRSFIA 
modulator) 


712- 

713 


RALY 


825583 


Hs.74111 


AA504617; 
AA504712 


RNA-binding protein 
(autoantigenic) 


714- 
716 


FGFR2 


809464 


Hs.282823 


M87771; 
AA443093; 
AA456160 


FGFR2=Fibroblast 
growth factor receptor 
2 


717- 
718 


EST 


242820 


Hs.290870 


H94050; 
H94131 


ESTs, Weakly similar 
to 138588 reverse 
transcriptase homolog 
[H.sapiens] 


719- 
720 


PEF 


137353 


Hs.241531 


R38031; 
R38117 


PEF protein with a 
long N-terminal 
hydrophobic domain 
(peflin) 


721- 
722 


EST 


265494 


Hs.153445 


N21309; 
N31244 


Human mRNA for 
unknown product, 
partial cds 


723- 
724 


SAST 


739625 


Hs.227489 


AA479623; 
AA477008 


syntrophin associated 
serine/threonine kinase 


725- 
726 


EST 


142499 




R70037; 
R70084 


Unknown 


727- 
728 


PLXNA2 


303035 


Hs.300622 


N91580; 
W19130 


plexin A2 


729- 
730 


EST 


240694 


Hs.167787 


H78135: 
H78134 


ESTs 
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731- 
732 


APMCFl 


198904 


Hs.12152 


K95693; 
R95692 


APMCFl protein 


375 


CALU 


144881 


Hs.7753 




calumenin 


733- 
734 


PPY2 


210873 


Hs.20588 


H67736; 
H66312 


pancreatic polypeptide 
2 


735- 
736 


CRBl 


248485 


Hs.169745 


N59646; 
N78199 


ciumbs (Drosophila) 
homolog 1 


737- 
738 


FU21661 


80095 


Hs.334718 


T63321; 
T63940 


hypothetical protein 
HJ21661 


739- 
740 


RAB3A 


163579 


Hs.27744 


H14231; 
H14230 


RAB3A, member RAS 
oncogene fiimily 


741- 
742 


GCAT 


307094 


Hs.54609 


N93695; 
W21033 


glycine C- 
acetyltransferase (2- 
amino-S-ketobutyrate 
coenzyme A ligase) 


743- 
744 


P14L 


809437 


Ifa. 178576 


AA458464; 
AA442976 


similar to Bos tatmis 
P14 protein 


745- 
746 


KIAAOOO 
8 


357373 


Hs.77695 


W93717; 
W93568 


KIAA0008 gene 
product 


747- 
748 


LOX 


341680 


Hs.102267 


W60414; 
W60413 


lysyl oxidase 


749- 
750 


PISD 


343609 


Hs.8128 


W69460; 
W69544 


phoq)hatidylserine 
decarboxylase 


751- 
752 


EST 


341834 


Hs.27278 


W60647; 
W60905 


ESTs, Weakly similar 
to A Chain A, 
CyclophilinA 
[H.sapiensl 


753- 
754 


EST 


809490 


Hs.3737 


AA443117; 
AA456181 


ESTs 


755- 
756 


TMEPM 


809824 


Hs.83883 


AA455519; 
AA464401 


transmembrane, 
prostate androgen 

induced RNA 


757- 
758 


ZNF211 


346947 


Hs.15110 


W79396; 
W79316 


zinc finger protein 211 


759 


LOC5160 
5 


810343 


Hs. 128791 


AA464166 


CXjI-09 protein 


760- 
761 


MAPREl 


428223 


Hs.234279 


AA001749; 
AA001819 


microtubule- 
associated proteio, 
RP/EBiamily, 
member 1 


762 


FU10701 


430068 


Hs. 146589 


AA009830 


hypothetical protein 
FLJ10701 


763- 
764 


DKFZP56 
4C186 


366353 


Hs. 134200 


AA026278; 
AA026277 


DKFZP564C186 
protein 


765 


EST 


810205 


Hs.264606 


AA464518 


ESTs 


766- 
767 


F23149_l 


428507 


Hs.152894 


AA004525; 
AA004607 


hypothetical protein 
F23149 1 


768- 
769 


FLJ21940 


810795 


Hs.104916 


AA458876; 
AA459066 


hypothetical protein 
FLJ21940 


770- 

771 


FU22059 


292223 


Hs.13323 


N62464; 
N79183 


hypo&etical protein 
FU22059 


772- 
773 


EST 


241861 


Hs.269020 


H93115; 
H93243 


ESTs 


398 


RGSl 


361323 


Hs.75256 




regulator of G-protein 
signalling 1 


774 


PDE6A 


361840 


Hs.182240 


W92514 


pho^hodiesterase 6A, 
cGMP-specific, rod, 
alpha 


775- 


ILIB 


491763 


Hs.126256 


AA150507: 


interleuldn 1, beta 
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776 








AA156711 




777 


SF3B4 


432564 


Hs.25797 


AA699361 


splicing factor 3b, 
subumt 4, 49kD 


/ fO 


POT 


7771597 
Z/ /OZ/ 


xiS.J4o4Z/ 


N45979 


Human SH3 domain- 
containing protein 
SiI3P18 mRNA, 
complete cds 


77Q 








AAooylio 


transcription factor 4 


780- 
7R1 


RAB2L 


741891 


H8.170160 


AA401972; 
AA402117 


RAB2, member RAS 
oncogene family-like 


782- 


GOLGAl 


34102 


Hs.172647 


R44140; 

K23o87 


golgi autoantigen, 
golgin subfamily a, 1 


784 


TNRC12 


770000 


Hs.306094 


AA427519 


trinucleotide repeat 
containing 12 


785 


CSNKIE 


854138 


Hs.79658 


AA669272 


casein kinase 1, 
epsilon 


786- 
787 


AFP 


74537 


Hs.155421 


T59043; 
T59118 


alpha-fetoprotein 


788 
/oo- 

789 






iis.155185 


AA156560; 
AA157732 


cytosolic ovarian 
carcinoma antigen 1 


7Qn 
791 


AT>X?Y 

/VJrJBrA 


I'njyU 1 


Hs.73722 


AA478273; 
AA478331 


APEX nuclease 
(multifunctional DNA 
repair enzyme) 


792- 
793 


RBBP2 


841655 


Hs.76272 


AA487492; 
AA487706 


retinoblastoma- 
binding protein 2 


794- 
796 


GROl 


323238 




M36820; 
W42723; 
W42812 


Human cytokine 
(GRO-beta)mRNA; 
GR02=GR0beta = 
MIP2 alpha = 
macrophage 
inflammatory protein- 
2 alpha - chemokine 


797- 
799 


WNT2 


149373 




X07876; 
H04382; 
H04408 


wingless-type MMTV 
integration site &mily 
meinber2 


800 


PTGS2 


845477 


Hs.196384 


AA644211 


cyclooxygenase-2; 
prostaglandin 
endoperoxide 
synthase-2 


oUl- 


JrKNl' 




Hs.74621 


AA029059; 
AA029163 


prion protein 


803- 
804 


WNT5B 


323636 


H8.306051 


W44518; 
W44517 


Homo sapiens mRNA 
forWNTSB, complete 
cds 


805 


CD72 


1241854 




AA714696 




806 




897774 




AA598510 


adenine 

phosphoribosyltransfer 
ase 


Rn7 
808 








AA460168 
AA460768 


protein phosphatase 1, 
regulatory (inhibitor) 
subumt ISA 


sno 
ouy- 

810 




oz5zl4 




AA5041 13 
AA504371 


M-phase 

phosphoprotein 10 


811- 
812 




154720 




R55220 


ARDl homolog, N- 
aceiyiuansrerase ^o. 
cerevisiae) 


813- 
814 




204214 




H59204 
H59203 


CDC6 cell division 
cycle 6 homolog (S. 
cerevisiae) 
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815- 
816 




815294 




AA481547 
AA481613 


protein tyrosine 
phospliatase, leceptor 
type, C-associated 
protein 


817 




825265 




AA504204 


polymerase (DKA 
directed), delta 3 


818 




2549467 




AI952542 


unknown EST 


819 




1056107 




AA628360 


putative cyclin Gl 
interacting protein 


820- 
821 




809515 




AA454565 
AA456458 


pLK==liomologue of 
Drosophila polo 
serine/threonine kinase 


822 


CMKBR6 






U45984 


CCR6=STRL22=chem 
okine receptor for 
MIP-3 

alpha/LARC/Exodus 
on activated B cells 
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Table 4. Markers that were Under-expressed in Ovarian Cancer in a 
Comparison of Ovarian Epitiielial Cancer to Normal Postmenopausal Ovarian 
Tissue 



SEQ.m. 
NO. 


IMAGE ID 


Nucleic 
Acid 


Description 


Average 

log 
normal 


Average 

log 
cancer 


Cancer 

to 
normal 


202 


878596 


ITM2A 


integral membrane protein 2A 


1.145 


-2.036 


0.110 


203-204 


425S8 


QAIM 


glycine amidinotransferase (L- 

arginine:glycine 

amidinotransferase) 


4.137 


0.945 


0.109 


205-207 


81417 


RNASE4 


ribonuclease L (2',5'- 
oligoisoadenylate syntiietase- 
dependent) 


2.057 


-0.744 


0.144 


208-210 


471642 


LAMA2 


laminin alpha 2 (merosin, 
congenital muscular dystrophy) 


2.806 


0.361 


0.184 


211 


448386 


PBX3 


pre-B-cell leukemia 

transcription factor 3 


2.354 


-0243 


0.165 


212 


1472538 


PLA2G6 


phospholipase A2, groiq> VI 
(cytosolic, calcium-independent) 


2.604 


0.099 


0.176 


213 


814636 


SMARCA2 


SWI/SNF related, matrix 
associated, actin dependent 
regulator of chromatin, 
subfamily a, meniber 2 


3.055 


0.231 


0.141 


214-215 


488956 


CUGBP2 


CUG triplet repeat, KNA- 
binding protein 2 


2.960 


-0.043 


0.125 


216-218 


209655 


TGFBR3 


TGF beta receptor type m 


1.956 


0.057 


0.268 


219 


859858 


STAR 


steroidogeoic acute legnlatory 
protein 


1.685 


0.026 


0.317 


220 


1636447 


OIGll 


guanine nucleotide blading 
protein 1 1 


1.683 


-0.953 


0.161 


221 


491565 


CITED2 


Cbp/p300-interacting 
transactivator, with Glu/Asp- 
rich carboxy-terminal domain, 2 


1.576 


-0.497 


0.238 


222 


744647 


CTNNALl 


catenin (cadherin-assodated 
protein), alpha-like 1 


1.498 


-0.761 


0.209 


223 


743773 


ABCA8 


ATP-binding cassette, sub- 
family A (ABCl), meniber 8 


2.317 


0.060 


0.209 


224-226 


188232 


KLF4 • 


GKLF=E2ar=KLF4=gut- 
enriched Kruppel-like zinc 
filler proteiff^jqnessed in 
vascular endothelial cells 


1.644 


-0.741 


0.191 


227-228 


471725 


ITPRl 


inositol 1,4,5-triphosphate 
receptor, type 1 


1.600 


-0.616 


0.215 


229-230 


487793 


MAP 


v-maf musculoaponeurotic 
fibrosarcoma (avian) oncogene 
homolog 


0.765 


-2.032 


0.144 


231-232 


768370 


FOXCl 


forkheadboxCl 


2.270 


-0.021 


0.204 


233 


461351 


TCF21 


transcription &ctor 21 


1.733 


0.193 


0.344 


234-236 


248295 


CCM 


CyclinI 


2.460 


-0.030 


0.178 


237-238 


209367 


DCN 


decorin 


3.762 


0.582 


0.110 


239-240 


789049 


CBF2 


CCAAT-box-binding 
transcription fector 


2.140 


0.019 


0.230 
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241-242 


68049 




. Homo sapiens, clone 
IMAGE:4I83312, mRNA, 
partial cds 


2.074 


0.243 


0.281 


122 


81417 


BKASE4 


ribonuclease L (2',5'- 
oligoisoadenylate synthetase- 
dependent) 


1.696 


. -0.117 


0.285 


243 


2094012 


SLC4A1AP 


solute carrier family 4 (anion 
exchanger), member 1, adapter 
protein 


1.869 


0.195 


0.313 


244-/45 


377731 


GSTM5 


glutathione S-transferase M5 


1.558 


0.242 


0.402 


246 


460470 


C4BPB 


con^lement component 4- 
binding protein, beta 


0.750 


-0.851 


0.330 


247-248 


73609 


HS3ST1 


heparan sulfate (glucosamine) 3- 
O-sulfotransferase 1 


2.017 


0.328 


0.310 


249 


2413955 


CDKNIC 


cyclin-dependent kinase 
inhibitor IC (p57. Kip2) 


2.739 


0.465 


0.207 


250 


897823 


HNRPDL 


heterogeneous nuclear 
ribonucleoprotein D-like 


1.703 


-0.229 


0.262 


251 


1558799 


CIRBP 


cold inducible RNA-binding 
protein 


1.817 


-0.183 


0.250 


252 


2321596 


RGS2 


regulator of G-protein signalling 
2, 24kD 


1.607 


-0.290 


0.269 


253-254 


786607 


TCEALl 


transcription elongation &ctor A 
(Sn)-like 1 


1.737 


0.083 


0.318 


255-256 


377461 


CAVl 


caveolin 1, caveolae protein, 
22kD 


0.146 


-2.506 


0.159 


257 


855624 


ALDHlAl 


aldehyde dehydrogenase 1 
family, member A 1 


2.030 


-0.068 


0.233 


258-259 


343443 


KBPMS 


RNA-binding protein gene with 
multiple splicing 


1.727 


-0.240 


0.256 


260-261 


62263 


ADAMTSl 


a disintegrin-like and 

metalloprotease (reprolysin 
type) with thrombospondin type 
1 motif, 1 


1.589 


0.029 


0.339 


2o2-2o3 


51408 


DSCRILI 


Down syndrome critical region 
gene 1-like 1 


2.054 


0.260 


0.288 


264 


436121 


DLKl 


delta-like homblog (Drosophila) 


0.307 


-1.944 


0.210 




491113 


CDHll 


cadherin 1 1 , type 2, OB- 
cadherin (osteoblast) 


2.308 


0.502 


0.286 


139 - 


840776 


SGK 


sgk=putative serine/threonine 
protein kinase transcriptionally 
modified during anisotonic and 
isoionic aiieraaons oi ceii 


1.444 


-0.201 


0.320 • 


267 


450060 


HFLl 


H factor (complement)-like 1 


2.490 


0.664 


0.282 




38347 




rnena oi viAi az 


i.yuj 


U.4oi 


0.368 


270 


898122 


\^ 1 


complement con^onent 7 


Z.043 


U.ooU 


0.253 


271-272 


784109 


SGCE 


sarcoglycan, epsilon 


1.573 


-0.213 


0.290 


273 


898305 


NBLl 


neuroblastoma, suppression of 
tumorigenicity 1 


1.910 


0.153 


0.296 


274-275 


173385 


HBB 


hemoglobin, beta 


0.165 


-2.317 


0.179 


276-278 


840683 


CARP 


Cytokine inducible nuclear 
protein 


1.728 


0.018 


0.306 


970-9 Rft 






microphthalmia-associated 
transcription factor 


0.242 


-1.580 


0.283 


281-282 


898062 


CDC20 


pSSCDC 


1.679 


-0.043 


0.303 


283-285 


148028 


EPS8 


epidermal growth factor receptor 
dnase substrate (£ps8) 


1.595 


-0.344 


0.261 
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286-287 


345680 


ARHI 


tas homolog graie fiunily, 
member I 


1.595 


0.212 


0383 


288 


857002 


B4-2 


proline-rich protein with nuclear 
targeting signal 


1.523 


0.285 


0.424 


289-291 


186132 


SELE 


ELAMl=endodiBlial leukocyte 
adhesion molecule I 


1.480 


-0.393 


0.273 


292-293 


133273 


PMP22 


peripheral myelin protein 22 


1.767 


0.017 


0.297 


294 


340657 


EBAF 


endometrial bleeding associated 
factor (left-light determination, 
factor A; transforming growth 
factor beta super&mily) 


1.422 


-0.076 


0.354 


295-296 


609663 


PRKAR2B 


protein kinase, cAMP- 
dependent, regulatory, type n, 
beta 


1.357 


-0.461 


0.284 


297 


1573311 


NPKBIE 


nuclear factor of kappa light 
polypeptide gene enhancer in B- 
cells iidiibitor, epsilon 


1.242 


-1.102 


0.197 


298-299 


269806 


KIT 


v-kit Hardy-Zuckerman 4 feline 
sarcoma viral oncogene 
homolog 


2.324 


0.315 


0.248 


158 


309864 


JUNE 


jun B proto-oncogene 


3.465 


1.117 


0.196 


300 


1573108 


BCKDK 


branched chain alpha-ketoacid 
dehydrogenase kinase 


1.306 


-1.057 


0.194 


301-303 


298268 


BTGl 


BTGl=B-cell translocation gene 
l=anti-proUferative 


1.409 


•0.239 


0319 


304-305 


784772 


AKAP12 


A kinase (PRKA) anchor protein 
(gravin) 12 


1.568 


-0.413 


0.253 


306-307 


898221 


NR4A2 


NOT=Immediate early response 
proteiiF=Nuirl 
homologue=Nurr77 orphan 
steroid receptor family member 


2.092 


0.341 


0.297 


308-309 


126531 


HBB 


hemoglobin, beta 


0.190 


-2.034 


0.214 


310-311 


768489 


ARHGAF6 


Rho GTPase activating protein 6 


1.263 


-0.109 


0.386 


312 


1568391 


PLS3 


plastin 3 (T isofonn) 


1.241 


-1.045 


0.205 


313-314 


300015 


FNTA 


famesyltiansferase, CAAX box, 
alpha 


1.354 


0.091 


0.417 


315-316 


770670 


TNFAIP3 


tumor necrosis &ctor, alpha- 
induced protein 3 


2.088 


0.445 


0.320 


317-318 


840944 


EGRl 


early growth response 1 


3.245 


0.765 


0.179 


319 


795213 


RNAC 


RNA cyclase homolog 


1.812 


0.043 


0.293 


320 


813584 


PA26 


p53 regulated PA26 nuclear 
protein 


1329 


0.084 


0.422 


321 


1573778 


Cllorfl3 


chromosome 11 open reading 
frame 13 


1.241 


-1.048 


0.205 


322 


2169465 


INGIL 


inhibitor of growth family, 
member 1-like 


1.267 


0.031 


0.425 


323 


2577249 


RPL9 


ribosomal protein L9 


1.756 


-0.313 


0.238 


324-325 


813711 


ADH5 


alcohol dehydrogenase 5 (class 
TU), chi polypeptide 


1.211 


-0.616 


0.282 


326-327 


298122 


FZD7 


fiizzled (Drosophila) homolog 7 


2.554 


0.747 


0.286 


328-329 


366100 


MATN2 


tnatrilin2 


2.205 


0.536 


0315 


330-331 


71863 


SLC11A3 


solute carrier family 1 1 (proton- 
coupled divalent metal ion 
transporters), member 3 


2.466 


0.808 


0317 


332-333 


767641 




ESTs 


0.965 


-0.153 


0.461 


334-335 


292463 


ERCC5 


excision repair cross- 
conq[)lemBnting rodent repair 
deficiency, complementation 


1.695 


-0.098 


0.289 
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Table 5. Markers that were Over-expressed in Ovarian Cancer in a Comparison 
of Ovarian Epitfielial Cancer to Normal Postmenopausal Ovarian Tissue 



SEQ.ID. 
NO. 


IMAGE 

m 


Nucleic 
Add 


Description 


Average 

log 
normal 


. Average 
log 
cancer 


Cancer 

to 
normal 


18-19 


82195 


SERPINF2 


Branched chaia keto acid 
dehydrogenase El, beta 
polypeptide (m^le syrup urine 
(Usease) 




1.45 




30-31 


295939 


FLJ22174 


hypothetical protein FLJ22174 




1.38 




50-51 


755599 


nTTMl 


Interferon induced 
transmembrane protein 1 (9-27) 




1.74 




55-57 


624655 


IFITM2 


Interferon-induced protein 1-8D 




1.53 




60 


786675 


HE4 


Epididymis-specific, whey-acidic 
protein type, four-disulfide core; 
putative ovarian carcinoma 
madcer 




2.41 




68-69 


782513 


G1P3 


Interferon, alpha-inducible 
protein (clone IFI-6-16) 




1.64 




74-76 


182288 


DDRl 


Receptor protein-tyrosine kinase 
EDDRl 




1.43 




85-86 


811139 


HLA- 
DRB5 


Major histocompatibility 
complex, class II, DR beta 5 




1.91 




101-103 


417711 


HLA- 
DRBl 


Major histocompatibility 
complex, class 11, DR beta 1 




1.94 




89-91 


725751 


CD74 


Invariant chain=la-associated 
invariant gamma-chain 




2.69 




92-93 


840681 


C3)74 


Invariant chain=la-a8sociated 
invariant gamma-chain 




2.58 




94-96 


117411 


HLA-DRA 


MHC Class n=DR a^ha 




1.62 




97-99 


207715 


HLA-DPA 


MHC Qass n=DP alpha 




1.85 




122-123 


361323 


RGSl 


regulator of G-protein signaling 1 




1.73 




133-135 


755279 


FOS 


c-fos 




1.76. 




149-151 


279388 


SORLl 


Mosaic protein LRl l=hybrid 
receptor gp250 precursor 




1.56 




164-166 


813256 


ABCBl 


MDRl^Multidrug resistance 
protein l=P-glycoprotein 




1.64 




167-168 


23804 


ZFP36 


Zinc finger protein homologous 
to 2<Q>-36 in mouse 




1.74 




169-171 


135880 


ZFP36 


TTP=tristetraproline=GOS24=zin 
c finger transcriptional regulator 




2.00 




174-175 


41650 


HGF 


hepatocyte growdi factor 
(hepapoietin A; scatter &ctor) 




1.54 




176-178 


840776 


SGK 


sgkFputative serine/threonine 
protein kinase transcriptionally 
noodified during anisotonic and 
isotonic alteration 




2.02 




179-180 


814508 


PPP1R7 


Protein phosphatase 1, regulatory 

subunit7 




1.78 




181-182 


204335 


0024 


CD24 antigen (small cell lung 
carcinoma cluster 4 antigen) 




2.75 
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190-192 


485770 


Bm 


Tis 1 ld=ERF-2=growth fector 
early response gene 




1.62 




199-201 


144675 


TLR3 


TLR3= Toll-like receptor 3 




1.98 




340-341 


378813 


SLPI 


secretory leukocyte protease 
inhibitor (antileukoproteinase) 


-0.379 


2.294 


6.377 


342 


378461 


SPPl 


secreted phosphoprotein 1 
(osteopontin, bone sialoprotein I, 
early T-lyn^bocyte activation 1) 


-2.657 


-0.088 


5.938 


343-344 


741977 


BF 


B-factor, properdin 


-0.362 


1.953 


4.974 


345-347 


810899 


CKSl 


ckshsl^'homolog of 

Cksl=p34Cdc28/Cdc2-associated 

protein 


-1.484 


0.637 


4.351 


348-349 


470393 


MMP7 


matrix metalloproteinase 7 
(matrilysin, uterine) 


0.673 


2.535 


3.635 


350-351 


742101 


PAX8 


paired box gene 8 


-0.566 


1.196 


3.391 


352-353 


814378 


SPINT2 


serine protease inhibitor, Kunitz 
type, 2 


-0.306 


1.432 


3.336 


354 


451907 


ZWINT 


ZWIO interactor 


-2.461 


-0.856 


3.043 


355 


2544675 


DGKH 


diacylglycerol kinase, eta 


-0.036 


1.498 


2.896 


356 


782811 


HMGIY 


high-mobility group (nonhistone 
chromosonoal) protein isoforms I 
andY 


-2.272 


-0.760 


2.851 


357-359 


504763 


SDC4 


Syndecan-4 = anq)higlycan= 
ryudocan core protein 


-0.871 


0.575 


2.725 


360 


1161155 


CDKN2A 


cyclin-dependent kinase inhibitor 
2A (melanoma, pl6, inhibits 
CDK4) 


-0.839 


0.593 


2.699 


361-362 


810873 


SCNNIA 


sodium channel, nonvoltage- 

gated 1 alpha 


0.127 


1.534 


2.652 


363 


43550 


LDHA 


lactate dehydrogenase A 


-3.496 


-2.152 


2.538 


364-365 


131839 


FOLRl 


folate recqptor 1 (adult) 


-0.867 


0.467 


2J22 


366-367 


855749 


TPIl 


Triosephosphate isomerase 1 


-2.272 


-1.008 


2.400 


368 


2514426 


KLK8 


kallikrein 8 (neuropsin/ovasin) 


-0.491 


0.742 


2.352 


200 


79629 


CXCR4 


CXC chemokine receptor 4== 
fusin=neuropeptide Y 
receptor=L3 


-0.588 


0.618 


2.307 


369-370 


825606 


KNSLl 


kinesin-like 1 


-1.797 


-0.602 


2.290 


371-372 


488964 


H2AF0 


H2A histone family, member O 


-1.329 


-0.144 . 


2.274 


373-374 


153411 


HLA-DRA 


major histoconq)atibility 
complex, class 11, DR alpha 


1.967 


3.150 


2.270 


O/D 






cysteine-rich protein 1 (intestinal) 


0.086 


1.246 


2.234 


D /O 


y ju /uu 


rr 


pyrophosphatase (inorganic) 


-1.029 


0.118 


2.214 






ESTs 


Unknown 


0.214 


1.360 


2.212 








glucose transporter (HepG2) 


-1.190 


-0.050 


2.204 


382 


897770 


ESTs 


Unknown 


-0.173 


0.943 


2.167 


383-385 


813673 


HDGF 


hepatoma-derived growth factor 


-0.786 


0.329 


2.166 


386 


882522 


ASS 


argininosuccinate synthetase 


-0.424 


0.676 


2.143 


387-388 


770388 


CLDN4 


claudin 4 


0.065 


1.159 


2.135 


389 


897956 


FRAME 


preferentially expressed antigen 
in melanoma 


-2.071 


-0.977 


2.134 


390-391 


897788 


PTPRF 


LAR = LCA-homologue 


-0.175 


0.900 


2.108 


392-393 


741139 


EYA2 


eyes absent (Drosophila) 
homolog 2 


-0.133 


0.939 


2.102 


394-396 


138917 


MYCLl 


L-myc 


-0.042 


1.026 


2.096 


397-399 


840691 


STATl 


STAT1=IFN alpha/beta- 
responsive transcription factor 


-0.023 


1.044 


2.095 
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ISGF3 beta subumts (p91^84) 








400-401 


564492 


MTCH2 


mitochondrial earner homolog 2 


-1.566 


-0.512 


2.076 


402 


435597 


HTR3A 


S-hydbroxytiyptanmie (serotonin) 
receptor 3A 


-0.392 


0.656 


2.067 


403-404 


68950 


CCNEl 


cyclin El 


-0.470 


0.577 


2.066 


405 


739155 


CDH6 


cadheiin 6, type 2, K-cadherin 
(fetal kidney) 


-0.286 


0.748 


2.048 


406-408 


531028 


PRKAGl 


5'-AMP-activated protein kinase, 
gamma-1 subunit 


0.148 


1.181 


2.046 


409 


2403485 


DEFBl 


defensin, beta 1 


0.335 


1357 


2.031 


410-411 


626S02 


ARPCIB 


actin related protein 2/3 conq>lex, 
subunit lA (41 kD) 


-0.808 


0.213 


2.030 


412-414 


71622 


PRKCI 


PKC iota=Protein kinase C, iota 


-0.202 


0.802 


2.006 


415 


1610448 


GAPD 


glyceraldehyde-S-phosphate 
dehydrogenase 


-1.484 


-0.480 


2.005 


416-417 


85497 


C2 


conqjlement conq)onent 2 


-0.413 


0.589 


2,002 


418-419 


843075 


H2AFY 


H2A histone family, member Y 


-0.829 


0.164 


1.990 


420-421 


840567 


TM4SF1 


transmembrane 4 supeifamily 
member 1 


-1.261 


-0.270 


1.987 


422-423 


50117 


GAPD 


glyceraldehyde-3-phosphate 
dehydrogenase 


-2.692 


-1.706 


1.981 


424-426 


809910 


IFITM3 


Interferon-inducible protein 1-8U 


-0.099 


0.887 


1.981 


427-428 


248261 


GLDC 


glycine dehydrogenase 
(decarboxylating; glycine 
decarboxylase, glycine cleavage 
system protein P) 


-0.757 


0.221 


1.970 


429-430 


144881 


CALU 


calranenin 


-1.579 


-0.620 


1.943 


431-432 


208764 


HBA2 


hemoglobin, alpha 2 


-0.114 


0.837 


1.934 


433 


810612 


SlOOAll 


SlOO calcium-binding protein 
Al 1 (calgizzarin) 


-0.670 


0.279 


1.931 


434-436 


897567 


LDHA 


Lactate dehydrogenase A 


-2.982 


-2.038 


1.925 


437 


769921 


UBE2C 


ubiquitin-conjugating enzyme 
E2C 


-1.429 


-0.487 


1.922 


438-440 


304908 


E2F3 


E2F-3=pRB-binding transcription 
&ctar=KIAA0075 


-0.526 


0.416 


1.921 


441-442 


251019 


CDHl 


E-cadheiin 


-0.248 


0.682 


1.905 


443-444 


210405 


PSME2 


proteasome (prosome, 
macropain) activator subunit 2 
(PA28beta) 


-0.715 


0.213 


1.902 


445-447 


344430 


BMP7 


OP-l<=osteogemc protein in the 
TGF-beta&mily 


-0.075 


0.852 


1.901 


448 


825470 


TOP2A 


T0P2A 






2 
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Table 6. Markers that were Differentially E]q>ressed Between BRCAl-Liaked 
and Sporadic Tumors in a Comparison to reference Immortalized Ovarian 
Epitlielial Cells. 



S£Q 

m 

NO. 


Nudeic 
Add 


Description 


Geometric 
mean of 
ratios m 
BRCAl 


Geometric 
mean of 
ranos in 
sporadic 


roia 
difference 
in 

geometric 
means 


one 


CD72 


B-cell difrerentiation antigen 
CD72 (human); 


1 AO 

I Ay 


in 
1.17 


A TA 


C A A 

544 


OT /^OC All 

SLC25A1 1 


solute carrier family 25 
(nntochondrial carrier; 
oxoglutarate carrier), member H 


1.27 


1 AO 


i>.o4 


CAS 

545- 
547 




lipocalin 2 (oncogene 24p3) 


1 oo 




A 

u./o 


538- 
540 


Dli' I 'IJ t'i'M 1 


mterieron-*gamma ihr oor 
5111; Interferon gamma 




101.1 f\A 

1.31; 1.U4 


u.o/; U.03 


543 


SIAHBPl 


fuse-binding protein-interacting 
repressor 


1.86 


1.21 


0.65 


533 


UBEl 


ubiquitin-activating enzyme El 


1.54 


0.94 


0.61 


524- 
526 


WAS 


Wiskott-Aldrich syndrome 
protein 


1.13 


0.79 


0.7 


541- 
542 


IDH2 


isocitrate dehydrogenase 2 
(NADP+), mitochondrial 


1.69 


1.02 


0.6 


527- 
528 


PCTKl 


PCTAIRE 1 serine/threonine 
protein kinase 


1.33 


1.12 


0.84 



Table 7. Markers that were Differentially Expressed Between AieC42-Liiiked 
and Sporadic Tumors in a Comparison to reference Immortalized Ovarian 
Epitlielial Cells. 



SEQ 
ID 
NO. 


Nucleic 
Acid 


Description 


Geometric 
mean of 
ratios in 
BRGAl- 
linlced 
tumors 


Geometric 
mean of 
ratios in 

sporadic 
tumors 


Fold 
difference 
in 

geometric 
means 


279 


LOC51760 


B/K protein 


1.32 


1.1 


0.83 


280 


LRPAPl 


low density lipoprotein-related 
protein-associated protein 1 
(alpha-2-macroglobulin 
receptor-associated protein 1) 


1.45 


1.13 


0.78 
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Table 8. Markers that were Differentially Expressed Between Combined BRCA- 
Linked Group and Sporadic Tumors in a Comparison to reference Immortalized 

Ovarian Epithelial Cells. 



SEQ 
ID 
NO. 


Nucleic 
Acid 


Descriptioii 


Geometric 
mean ox 
ratios in 
BRCAl- 
Unked 
tumors 


Geometric 
mean of 
ratios in 
SDoradic 

tumors 


Fold 
difference 
in 

means 


281 


PSTPIPl 


interferon-gamzna lEF SSP 
511 l==Interferon gamma 
upregulated protein 


1.73; 1.41 


1.31; 1.04 


0.76; 0.74 


282 


roH2 


Isocitrate dehydrogenase 2 
(NADP+), mitochondrial 


1.66 


1.02 


0.61 


274 


PCTKl 


PCTAIRE 1 serine/tbreonine 
protein kinase 


1.29 


1.12 


0.86 



Table 9. Markers that were Differentially Expressed between BRCAUmi^ and 
^i;C42-like tumors in a Comparison to reference Immortalized Ovarian 
Epithelial Cells. 



SEQ 
ID 
NO: 


Gene 


Desolption 


Geometric 
mean of 
ratios in 
BRCAl 


Geometric 
mean of 
ratios in 
BRCAl 


Fold 

Difference 
in 

Geometric 
Means 


122- 
123 


RGSl 


regulator of G-protein 
signalling 1 


1.79 


4.75 


2.65 


122- 
123 


RGSl 


BL34=RGSl=regulator of G- 
protein^ignaling which inhibits 
SDF-1 directed B cell 
nngration 


2.09 


5.05 


2.41 


594- 
596 


RGS16 


A28-RGS14p=G protein 
signaling regulator 


1.22 


2.32 


1.9 


612- 
614 


SCYA4 


MIP-1 beta=SCAY2=G- 
26=HC21=pAT 744=LAG- 
1-Act-2=H400=SIS- 
ganmia=H;hemo]dne 


1.29 


2.23 


1.73 


612- 
614 


SCYA4 


MIP-1 beta=SCAY2=G- 
26=HC21=pAT 744=LAG- 
l=Act-2=H400=SIS- 
gamm^chemoldne 


1.09 


1.79 


1.64 


515- 
517 


SFRP4 


frizzled related protein fipHE 


1.13 


1.85 


1.63 


594- 
596 


RGS16 


A28-RGS14p=G protein 
signaling regulator 


1.33 


2.11 


1.58 


790- 
791 


APEX 


APEX nuclease 
(multifunctional DNA repair 
enzyme) 


0.66 


1.04 


1.58 


682 


SFRSll 


splicing factor, arginine/serine- 
richll 


0.69 


1.09 


1.57 


507- 
509 


PTEN 


MMACl=PTBN=^runior 

siqjpressor gene at 10q23.3 tiiat 
is Mutated in Multiple 
Advanced 

Cancers=Phosphatase and 


1.03 


1.56 


1.51 
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tensin homolog 








774 


PDE6A 


phosphodiesterase 6A, cGMP- 
specific, rod, alpha 


1.23 


1.85 


1.51 


562- 
564 


CD83 


aD83=B-G antigen IgV 
domain homolog='B-cell 
activation protein=HB15 


1.46 


2.19 


1.5 


592- 
593 


ZFP161 


2T5=POZ domain zinc finger 

protein 


1.03 


1.49 


1.45 






ESTs 


1.17 


1.69 


1.44 


707- 
708 


FOXOIA 


forkhead box 01 A 
(rhabdomyosarcoma) 


1.38 


1.93 


1.4 


762 


FLJ10701 


hypothetical protein FLJl 070 1 


1 


1.4 


1.39 


577- 
579 


CD36 


CD36 


1.32 


1.82 


1.38 


797- 
799 


WNT2 


Wnt-2 


0.81 


1.12 


1.38 






Unknown 


0.82 


1.13 


1.38 


779 


TCF4 


transcription factor 4 


1.19 


1.62 


1.36 


615- 
617 


KDR 


Kinase insert domain receptor 
(a type HI receptor tyrosine 
kinase) 


0.8 


1.08 


1.35 






ESTs 


0.81 


1.09 


1.35 


534- 
535 


LOC5i760 


B/K protein 


0.98 


1J2 


135 


797- 
799 


WNT2 


Wnt-2 


0.99 


1.33 


1.34 


683- 
684 


CUGBPl 


CUG triplet repeat, RNA- 
binding protein 1 


0.73 


0.98 


133 


709 


CSRP2 


cysteine and glycine-iich 

protein 2 


0.9S 


1.31 


1.33 






ESTs, Moderately similar to 
ALU4 HUMAN ALU 
SUBFAMILY SB2 
SEQUENCE 
CONTAMINATION 
WARNING ENTRY 
[H.sapiens] 


0.81 


1.06 


132 


606- 
608 


MMP13 


MMP-13=Matrix 
metalloproteinase 13=CL- 
3=Collagenase 3 


0.99 


1.3 


1.32 


580- 
582 


PDGFRB 


Platelet-derived growth &ctor 
receptor, beta 

polypeptide=fused to TEL in 
t(5;12)(q33;pl3) chronic 
myelomonocytic leukemia 


1.41 


1.85 


131 


603- 
605 


ENPPl 


PC-1 = alkaline nucleotide 
pyrophosphatase 


1.01 


1.32 


131 






FGFR2=Fibroblast growth 
&ctor receptor 2 


0.8 


1.05 


131 


695- 
696 


SMGl 


PI-3-kinase-related kinase 
SMG-1 


1.07 


1.4 


131 


521- 
523 


TALI 


scl=tal-l=T-cell acute 
lymphocytic leukemia 1 


1.14 


1.49 


131 


727- 
728 


PLXNA2 


plexin A2 


1.32 


1.71 


13 


759 


LOC51605 


CGI-09 protein 


0.8 


1.04 


13 


784 


TNRC12 


trinucleotidB repeat containing 
12 


1.02 


1.33 


13 
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EST 


0.86 


1.12 


1.3 


797- 
799 


WNT2 


wingless-type MMTV 
integration site family member 
2 


0.98 


1^7 


1.29 


693- 
694 


P0LR2A 


polymerase (RNA) n (DNA 
directed) polypeptide A 
(220kD) 


0.79 


1.01 


1.29 


737- 
738 


FU21661 


hypothetical protein FLJ2 1661 


0.61 


0.79 


1.28 


780- 
781 


RAB2L 


RAB2, member RAS oncogene 
family-like 


1.05 


1.35 


1.28 


577- 
579 


CD36 


CD36 


1.26 


1.61 


1.28 


568- 
570 


RYBP 


Death effector domain- 
associated &ctoF=Binds to 
Caspase 10 DED 
domain=Homolog of mouse 
RYBP repressor protein that 
interacts withPolycoiob 
con5)lex and YY1=YAF2 
homolog=DEDAF=YAF2 
homolog=MLNewGene3 


0.89 


1.14 


1.28 


571- 
573 


ITGAE 


CD103 alpha=Integrin alpha-E 


1.09 


1.38 


1.27 






Human 8113 domain-containing 
protein SH3P18 mRNA, 
c(Hiq)lete cds 


1.23 


1.56 


1.27 


755- 

756 


TMEPAI 


transmembrane, prostate 
androgen induced RNA 


0.99 


1.25 


1.26 


565- 
567 


NCOAl 


SRC-l=steroid receptor 
coactivator 


1.04 


1.3 


1.25 


785 


CSNKIB 


casein kinase 1, epsilon 


0.74 


0.92 


1.25 


768- 
769 


FU21940 


hypothetical protein FLJ2 1 940 


0.91 


1.14 


1.25 


723- 
724 


SAST 


syntrophin associated 
serine/threonine kinase 


1.05 


1.32 


1.25 






ESTs 


0.89 


1.11 


1.25 


782- 
783 


GOLGAl 


golgi autoantigen, golgin 
subfamily a, 1 


0.76 


0.95 


1.24 


574- 
576 


IL7 


IL-7 


0.99 


1.23 


1.24 


319 


RNAC 


RNA cyclase homolog 


0.92 


1.13 


1.24 


676- 
677 


KIAA0365 


KIAA0365 gene product 


0.97 


1.2 


1.23 


679- 
681 


ARHGEF6 


KIAA0006 


1.04 


1.28 


1.23 


710- 
711 


BRE 


brain and reproductive organ- 
expressed (TNFRSFIA 
modulator) 


1.11 


1.35 


1.22 






Unknown 


0.9 


1.11 


1.22 


670- 
671 


PONl 


paraoxonase 1 


0.93 


1.14 


1.22 






ESTs, Weakly similar to 
138588 reverse transcriptase 
homolog DEI.sapiensl 


1.11 


1.36 


1.22 






ESTs 


0.72 


0.87 


1.21 




ATP7A 


ATPase, Cu-H- transporting, 
alpha polypeptide (Menkes 


0.99 


1.2 


1.21 
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syndiome) 












Unknown 


1.2 


1.45 


1.21 


735- 
736 


CRBl 


cnimbs Prosophila) homolog 
1 


0.88 


1.06 


1.21 


757- 
758 


ZNF211 


zinc fbger protein 211 


0.82 


0.99 


\2\ 






ESTs 


0.99 


1.16 


1.18 


685- 
686 


GABRP 


gamma-aminobutyric acid 
(GABA) A receptor, pi 


0.91 


. 1.07 


1.17 


687- 
688 


BMP6 


bone motphogenetic protein 6 


0.95 


1.1 


1.16 


587- 

588 


BAD 


BADH>bc6^nroapoptotic Bcl-2 
homolog 


1.11 


0.94 


0.85 


678 


NCSTN 


nicastrin 


1.13 


0.94 


0.83 


766- 
161 


F23149 1 


hypoHietical protein F23149 1 


1.11 


0.91 


0.82 


701- 
702 


FLJ12442 


hypodietical protein FIJ12442 


1.04 


0.85 


0.82 


589- 
591 


ZNF173 


acid finger protein 


1.16 


0.95 


0.81 


741- 
742 


GCAT 


glycine C-acetyltransfeiase (2- 
aniino-3-ketobutyrate 
cocDzyme A ligase) 


1.12 


0.91 


0.81 


786- 
787 


AFP 


a^ha-fetoprotein 


1.2 


0.96 


0.8 






IiPAK65=SER/THR-protBin 
kinase PAK-gamma =P21- 
activated kinase 3 


1.05 


0.84 


0.8 


747- 
748 


LOX 


lysyl oxidase 


0.93 


0.75 


0.8 


662- 
663 


HARS 


histidyl-fRNA synthetase 


0.73 


0.57 


0.79 


544 


SLC25A11 


solute carrier family 25 
(mitochondrial cairien 
oxoglutarate carrier), member 
11 


1.27 


1 


0.78 


697- 
698 


GTPBPl 


GP-l=putative G-protein 


0.97 


0.76 


0.78 


699- 
700 


GS2NA . 


nuclear autoantigen 


0.96 


0.75 


o:78' 


705- 
706 


KIAA0144 


KIAA0144 gene product 


0.87 


0.68 


0.78 






Unknown 


1.02 


0.79 


0.78 


733- 
734 


PPY2 


pancreatic polypeptide 2 


1.49 


1.16 


0.78 


653- 
654 


NAGA 


N-acetylgalactosaminidase, 
alpha- 


1.06 


0.82 


0.78 


583- 

584 


IL17R 


IL-17 receptor 


1.05 


0.82 


0.78 


657- 
658 


SLC9A1 


solute carrier family 9 
(sodium/hydrogen exchanger), 
isoform 1 (antiporter, Na+/H+, 
amiloride sensitive) 


0.97 


0.75 


0.77 


691- 
692 


RBBP4 


retinoblastoma-binding protein 
4 


0.87 


0.67 


0.76 


609- 
611 


ILK 


ILK=integiin-linked kinase 


0.88 


0.67 


0.76 


624- 


PTK2B 


protein tyrosine kinase PYK2 ' 


1.07 


0.81 


0.76 
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625 












504- 
506 


AKTl 


BPGF-l=bone-derived growfli 
factoi=Q6=quiescm=expression 
is induced by reversible growth 
arrest, trypsinization and serum 
starvation and is blocked by 
SV40 transfinmation 


0.75 


0.56 


0.75 


763- 
764 


DKFZP564C186 


DKFZP564C1 86 protein 


0.97 


0.73 


0.75 


554- 
556 


SCYB5 


ENA78=cliemokine 


0.34 


0.25 


0.75 


660- 
661 


MNATl 


menage a trois 1 (CAK 
assembly factor) 


1.24 


0.93 


0.75 


548- 
550 


TCaEB2 


Elongin B=KNA polymerase II 
transcription &ctor Sm pl8 
subunit 


0.98 


0.73 


0.74 


792- 

793 


RBBP2 


retinoblastoma-binding protein 
2 


1.47 


1.08 


0.74 


626- 
628 


MAP2K3 


Dual specificity mitogen- 
activated protein kinase kinase 
3 


1.06 


0.78 


0.74 


712- 
713 


RALY 


RNA-binding protein 
(autoantigenic) 


0.85 


0.62 


0.74 


743- 
744 


P14L 


similar to Bos taurus P14 

protein 


0.87 


0.64 


0.73 


731- 
732 


APMCTl 


APMCFl protein 


0.95 


0.7 


0.73 


674- 
675 


GNB2 


guanine nucleotide binding 
protein (G protein), beta 
polypeptide 2 


0.99 


0.73 


0.73 






Lyn;>hotoxin-Beta receptor 
precursor=Tumor necrosis 
factor receptor 2 related 
proteiiF^Tumor necrosis &ctor 
C receptor 


1.28 


0.93 


0.73 






ESTs 


1.02 


0.73 


0.72 


666- 
667 


MPI 


mannose phosphate isomerase 


1.22 


0.87 


0.71 


719- 
720 


PEF 


PEF protein widi a long N- 
terminal hydrophobic domain 
(peflin) 


1 


0.71 


- 

0.71 


651- 

652 


FDFTl 


&mesyl-dq>h0sphate 

famesyltransferase 1 


1.09 


0.77 


0.71 


739- 
740 


RAB3A 


RAB3A, member RAS 
oncogene fimuly 


0.8 


0.57 


0.71 






EST 


1.49 


1.05 


0.71 


621- 
623 


PPP2R5A 


phosphatase 2A B56-alpha 
(PP2A) 


0.89 


0.63 


0.71 


600- 
602 


GART 


Phosphoribosylglycinamide 
fonnyltransferase, 
phosphoribosylglycinamide 
synthetase, 

phosph<mbosylaminoimidazole 
synthetase 


0.82 


0.58 


0.7 


551- 
553 


MM23H1 


nm23-Hl=NDP kinase 
A=Nucleoside dephophate 
kinase A 


0.71 


0.49 


0.7 


655- 
656 


SECRET 


secretagogin 


0.84 


0.S8 


0.69 
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Unknown 


0.8 


0.55 


0.69 






EST 


1.19 


0.82 


0.69 


770- 
771 


FLJ22059 


hypothetical protein FLJ22059 


0.7 


0.49 


0.69 


659 


TUFM 


Tu translation elongation 
factor, mitochondrial 


1.16 


0.8 


0.69 


518- 
520 


RUNXl 


core binding &ctor alpbalb 
subimit=CBF 
alphal=PEBP2aAl 
transcription &ctor =AML1 
Proto-oncogene=translocated ia 
acute myeloid leukemia 


0.69 


0.47 


0.69 


585- 
586 


HGF 


Hepatocyte growth factor 
(hepapoietin A; scatter factor) 


0.88 


0.6 


0.68 






EST, Weakly similar to 
PRPP HUMAN SALIVARY 
PROLINE-RICH PROTEIN H- 
1 [H.sapiens] 


0.98 


0.67 


0.68 






Unknown 


1.02 


0.7 


0.68 






Human mRNA for unknown 
product, partial cds 


0.57 


0.39 


0.68 


618- 
620 


IL18R1 


ILrIi receptor l°IL-lRrp 


0.88 


0.6 


0.68 


510- 
512 


UBLl 


ubiquitin-homology domain 
protein PICl 


1.49 


1.01 


0.68 


703- 
704 


KIAA0218 


KIAA0218 gene product 


1.2 


0.81 


0.68 


760- 
761 


MAPREl 


microtubule-associated protein, 
RP/EB family, member 1 


0.6 


0.4 


0.67 


777 


SF3B4 


spUcing &ctor 3b, subunit 4, 
49kD 


1.17 


0.78 


0.67 






Unknown 


1.04 


0.7 


0.67 






ESTs 


0.42 


0.28 


.0.66 


533 


UBEl 


ubiquitin-activating enzyme El 
(A1S9T and BN75 tenqjerature 
sensitivity complementing) 


1.54 


1.01 


0.66 






Unknown 


1.23 


0.81 


0.66 


788- 
789 


COVAl 


cytosolic ovarian carcinoma 

antigen 1 


0.57 


0.37 


0.64 






Unknown 


0.82 


0.53 


o:64 


668- 
669 


TAGLN2 


transgelin 2 


0.7 


0.44 


0.64 


749- 
750 


PISD 


phosphatidylserine 
decaiboj^lase 


0.8 


0.5 


0.62 


775- 
776 


ILIB 


interleuMn 1, beta 


0.3 


0.17 


0.58 


429- 
430 


CALU 


calumenin 


0.7 


0.39 


0.56 


597- 
599 


PPPICB 


PPP 1 CB=Protein phosphatase 
1, catalytic subunit, beta 
isoform 


0.78 


0.44 


0.56 


751- 
752 




ESTs, Weakly sunilar to A 
Chain A, Cyclophilin A 
[H.sapiens] 


0.78 


0.43 


0.55 


745- 
746 


KIAA0008 


KIAA0008 gene product 


0.49 


0.27 


0.55 






Unknown 


2 


1.09 


0.54 


664- 


EIF4A1 


eukaryotic translation initiation 


0.7 


0.37 


0.53 
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665 




factor 4A, isofoim 1 








559- 
561 


S100A4 


SlOO calcium bmding protein 
A4=Placeiital calcium binding 
protein=Calvasculin=mtsl 
PROTEIN=CAPL 


2.28 


1.21 


0.53 


375 


PPIA 


peptidylpiolyl isomerase A 
(cyclophilinA) 


0.76 


0.39 


0.52 






EST 


2.56 


126 


0.49 
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Table 10. Markers that can be used to Classify BRCA1-Vik& from BRCA2-Vik& 
Tnmor Types using Compound Covariate Prediction Analysis. 



SEQ 

m 

NO: 


Gene 


Desciiptioii 


t- 
value 


Average 
Log ratios 
in BRCA2 
& 

BRCA2- 

Uke 
sporadics* 


Midpoint of 
average log- 
ratios in 
each class 


Average log ratios 
in 

BRCAl&BRCAl- 
like sporadic 


659 


TUFM 


Tu translation elongation 
fiictor, mitochondrial 


-10 


-0.09854 


-0.016 


0.067443 


749- 
750 


PISD 


phosphatidylserine 
decarboxylase 


8.2305 


-0.28567 


-0.187 


-0.08778 


745- 
746 


KIAA0008 


KIAA0008 gene product 


8.0421 


■0.56864 


-0.431 


-0.29414 


703- 
704 


KIAA0218 


KIAA021 8 gene product 


7.9288 


-0.08197 


-0.005 


0.071882 


751- 
752 


EST 


ESTs, Weakly similar to A 
Chain A, Cyclophilin A 
[H.sapiens] 


7.6225 


-0.34775 


-0J225 


-0.10292 


621- 
623 


PPP2R5A 


.phosphatase 2A B56-alpha 
(PP2A) 


-7.469 


-0.20343 


-0.121 


-0.03763 


733- 
734 


PPY2 


pancreatic polypeptide 2 


7.3866 


0.06558 


0.113 


0.160168 


649 


EST 


Unknown 


-7.384 


■0.27327 


-0.183 


-0.09313 


641 


EST 


EST, Weakly similar to 
PRPP^HUMAN 
SALIVARY PROLINE- 
RICH PROTEIN n-1 
[H.sapiens] 


7.3561 


-0.17457 


-0.095 


-0.01592 


375 


PPIA 


peptidylprolyl isomerase A 
(cyclophilin A) 


6.9946 


-0.38934 


-0.258 


-0.12668 


770- 
771 


FU22059 


hypothetical protein 
FLJ22059 


6.9726 


-0.31605 


-0.228 


-0.14026 


739 


RAB3A 


RAB3A member RAS 
oncogene family 


6.9458 


-0.23582 


-0.167 


-0.098 


655- 
656 


SECRET 


secretasosin 


6.9307 


-0 23657 


-0.147 


-0.0575 


629- 
631 


TNFR2RP 


Lymphotoxin-Beta receptor 
precursor=Tumor necrosis 
factor rec^tor 2 related 
protem==Tumor necrosis 
factor C receptor 


6.9268 


-0.02733 


0.038 


0.103462 


551- 
553 


MM23H1 


nm23-Hl=NDP kinase 
A-Nucleoside dephophate 
kinase A 


6.8307 


-0.32239 


-0.242 


-0.16241 


557- 
558 


PAK2 


hPAK65=SER/THR-protein 
kinase PAK-garoma =P21- 
activated kioase 3 


6.7214 


-0.1152 


-0.05 


0.01536 


806 


APRT 


adenine 

phosphoribosyltransferase 


6.6725 


0.044932 


0.085 


0.125156 


807- 
808 


PPP1R15A 


protein phosphatase 1, 
regulatory (inhibitor) 
subunit 15 A 


-6.648 


-0.21681 


-0.144 


-0.0716 


544 


SLC25A11 


solute carrier family 25 
(mitochondrial carrier; 
oxoglutarate carrier), 
member 11 


6.6083 


0.003461 


0.047 


0.089905 
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719- 
720 


PEF 


PEF protein with a long N- 
tmninal hydrophobic 
domain (peflin) 


6.6034 


-0.16368 


-0.088 


-0.01144 


747- 
748 


LOX 


lysyl oxidase 


6.4441 


-0.12784 


-0.077 


-0.02641 


775- 
776 


IL16 


interleukin l,beta 


6.4272 


-0.75203 


-0.«7 


-0.52288 


809- 
810 


MPHOSPHIO 


M-phase phosphoprotein 10 
(U3 sntiall nucleolar 
ribonucleoprotein) 


-6.425 


-0.10347 


-0.048 


0.007748 


653- 
654 


NAGA 


N-acetylgalactosaminidase, 
alpha- 


-6.42 


-0.09205 


-0.034 


0.024896 


760- 
761 


MAPREl 


microtubule-associated 
protein, RP/EB family, 
member 1 


-6.39 


-0.39254 


-0.296 


-0.19928 


811- 
812 


ARDl 


N-acetyltransferase, 
homolog of S. cerevisiae 
ARDl 


_ 

6.3833 


-0.1707 


-0.11 


-0.04915 


813- 
814 


ODCe 


CDC6 (cell division cycle 6, 
S. cerevisiae) homolog 


-6.371 


-0.25964 


-0.201 


-0.14327 


643 


EST 


EST 


6.3541 


0.133858 


0.253 


0.371068 


583- 
584 


IL17R 


IL-17 receptor 


6.3499 


-0.08991 


-0.035 


0.019532 


803 


WNT5B 


wingless-type MMTV 
integration site family, 
member 5B 


6.3391 


-0.06803 


-0.017 


0.035029 


651- 
652 


FDFTl 


fimesyl-diphosphate 
famesyltransferase 1 


6.3387 


-0.1152 


-0.038 


0.039414 


664- 
665 


EIF4A1 


eukaryotic translation 
initiation &ctor 4A, isoform 
1 


6.2705 


-0.39362 


-0.263 


-0.13253 


650 


EST 


Unknown 


6.2573 


-0.08355 


-0.002 


0.079181 


657- 
658 


SLC9A1 


solute carrier family 9 
(sodium/hydrogen 
exchanger), isoform 1 
(antiporter, Na+/H+, 
amiloride sraisitive) 


_ 

6.2571 


-0.13608 


-0.077 


-0.01682 


731- 
732 


APMCFl 


APMCFl protein 


6.2387 


-0.15677 


-0.083 


-0.00922 


503 


ZNF220 


zinc finger protein 220 


6.2316 


-0.13549 


-0.072 


-0.00922 


815- 
816 


PTPRCAP 


LPAP=lymphoid-restricted 
phosphoprotein==CD45 
phosphatase binding protein 
and putative substrate 


-6.229 


-0.10182 


-0.059 


-0.01592 


817 


P0LD3 


polymerase (DNA directed), 
delta 3 


-6.223 


-0.29843 


-0.221 


-0.14327 


788- 
789 


COVAl 


cytosolic ovarian carcinoma 
antigen 1 


6.1802 


-0.42946 


-0.321 


-0.21325 


701- 
702 


FU12442 


hypothetical protein 
FU12442 


6.1607 


-0.07676 


-0.027 


0.023252 


818 


EST 


Unknown 


0.1033 




-U.U J / 


U.Ui. 177 J 


721- 
722 


EST 


Human mRNA for 
unknown product, partial 
cds 


6.1032 


-0.40671 


-0.328 


-0.24949 


662- 


HARS 


histidyl-tRNA synthetase 




-0.24642 


-0.186 


-0.12552 
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.663 






6.0889 








RIO 


FU20746 


putative cyclin Gl 
interacting protein 


6.0827 


-0.04144 


0.017 


0.074451 


821 


PT IT 


pLK=homologue of 
DrosopMla polo 
serine/threonine kinase 


6,0725 


-0.32422 


-0.253 


-0.18177 


784 




trinucleotide repeat 
containing 12 


6.0744 


0.110253 


0.062 


0.01368 


723- 




syntrophin associated 
serine/threonine kinase 


6.0991 


0-107549 


0.065 


0.022016 


634- 


Col 


J30 IS 


6.1178 


0.048053 


-0.002 


-0.05306 


695- 
oyo 


oXVliJi 


PI-3-kinase-related kinase 

oM(jr-i 


6.1219 


0.146438 


0.087 


0.028571 


534- 


JLUUjI /OU 


B/K protem 


6.1306 


0.09691 


0.048 


-0.00174 


574- 

575 


TT n 


IL-7 


6.1558 


0.088845 


0.043 


-0.00305 


670- 
671 


PONl 


paiaoxonase 1 


6.3314 


0.058046 


0.014 


-0.03105 


592- 
593 


ZFP161 


ZF5=P0Z domain zinc 
finger protein 


6.3356 


0.196176 


0.101 


0.006466 


632- 
633 


EST 


ESTs, Moderately similar to 
ALU4_HUMANALU 
SUBFAMILY SB2 

U UlN OJti 

CONTAMINATION 
WARNING ENTRY 
[H.sapiens] 


6.3426 


0.028978 


-0.029 


-0.08725 


/OJ 




ESTs 


6.4363 


0.026533 


-0.03 


-0.08619 


822 


CCR6 


CCR6=STRI22==cliemokine 
receptor for MIP-3 
axpna/j^iVJvwisxoaus on 
activated B cells 


6.4897 


0.116276 


0.069 


0.022016 


681 


ARHGEF6 


KIAA0006 






U.UOD 




759 


LOC51605 


CGI-09 nrotein 




li.UUoO 






762 


FLJ10701 


hypothetical protein 
FLJ10701 


6.925 


0.135133 


0.075 


0,0141 


636 


EST 


Unknown 


7.3197 


0.050766 


' 0.005 


-0.04144 


647 


EST 


EST 


7.5484 


0.045714 


-0.012 


-0.07007 


725- 
726 


EST 


Unknown 


8.058 


0.068557 


-0.006 


-0.07988 



Table 11. Results of Compound Covariate Predictor Analysis. 



Expld 


Pre-specified class label 


Correctly classified 


B2-1 VS0SEB2-1 vsOSE 
21083 


1 


YES 


B2-10 vs OSE B2.10 vs OSE 
21085 


1 


YES 


B2-16 vs OSE B2-16 vs OSE 
21180 


1 


YES 


B2-2vsOSEB2-2vsOSE 
21090 


1 


YES 


B2-20 vs OSE B2-20 vs OSE 


1 


YES 
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21181 






B2-21 VSOSEB2-21 vsOSE 
21182 


\ : 


YES 


B2.22 vs OSE 32-22 vs OSE 
21183 


\ 


YES 


B2-23 vs OSE B2-23 vs OSE 
21091 


\ 


YES 


B2-24 vs OSE B2-24 vs OSE 
21092 


\ 


NO 


B2-25 vs OSE B2-25 vs OSE 
22038 


\ 


YES 


B2.3vsOSEB2.3vsOSE 
21093 


\ 


YES 


B2-4 vs OSE B2-4 vs OSE 

21094 


1 


YES 


B2-5vsOSEB2-5vsOSE 
21095 


1 


NO 


B2-7 vs OSE B2.7 vs OSE 
21096 


\ 


YES 


B2-8vsOSE B2-8vsOSE 
21097 


\ 


YES 


B2-9 vs OSEB2-9vsOSE 
21098 


\ 


YES 


ClOOvsOSEClOOvsOSE 
21167 


\ 


YES 


C102vsOSEC102vsOSE 
21168 


\ 


YES 


C103vsOSEC103vsOSE 
21169 


\ 


YES 


C105 vs OSEC105 vs OSE 
21178 


\ 


YES 


C107vsOSEC107vsOSE 
21099 


\ _ 


YES 


Clio vs OSE Clio vs OSE 
21101 


\ 


YES 


cm vsOSEClllvsOSE 
21102 


1 


NO 


C117vsOSEC117vsOSE 
21105 


1 


YES 


C118 vs OSE CI 18 vs OSE 
21106 




• 

YES 


C123 vs OSE C123 vs OSE 
21107 




YES 


C4o vs OSE C4o vs OSE 19741 




NO 


C77 vs OSE C77 vs OSE 21108 




YES 


C84 vs OSE C84 vs OSE 21368 




YES 


C85 vs OSE C85 vs OSE 21 179 




YES 


C99 vs OSE C99 vs OSE 21370 




YES 


B36 vs OSE B36 vs OSE 19680 


2 


YES 


B39 vs OSE B39 vs OSE 19682 


2 


YES 


B40 vs OSE B40 vs OSE 19683 


2 


YES 


B41 vs OSE B41 vs OSE 19684 


2 


YES 


B52-2 vs OSE B52-2 vs OSE 
19771 


2 


NO 


JDjH vs L^oH JtJj'f vs UoJd l70o / 


z 


YES 


B55 vs OSE B55 vs OSE 19688 


2 


YES 


B60 vs OSE B60 vs OSE 19690 


2 


YES 


B61 vs OSE B61 vs OSE 19695 


2 


YES 


B62 vs OSE B62 vs OSE 19701 


2 


YES 
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Tk^'X ire OQT7 Tl^l vc Ci^V 1 070/? 
DOO VS L^oJD dOd VS Il^oI!* ly /UO 


2 


YES 




2 


YES 


R7n vc HQTJ U7n vc OQT? 1 0770 


2 


YES 


l3/*f VS Uoli Jd/4 VS Vy 1 LI 


2 


YES 


B17 VS OSE B77 vs OSE 19731 


2 


YES 


r> /o VS Ubii B7o VS UbH 21103 


2 


YES 


r> /y VS UoJd B/y VS UoJb 19743 


2 


YES 


DO\) VS Uoxi BoU VS UoJl zlUoo 


2 


YES 


CI 14 VS UoEt 14 VS Uob 
xllU*f 


2 


YES 


1^10 VS KJoEt v^lj VS Uoxl ly /j4 


2 


YES 


v^io VS uoJts uio VS wois ly / j j 


2 


YES 


r^17 xra AQT7 /^17 r\QX* IQTa^C 

\^1 / VS UoH V-/1 / VS Uoii ly /30 


2 


YES 


UlVS UoiS 1^1 VS Uoxi ly/32 


2 


YES 


UzU VS Uoc VS UoJi 19737 


2 


YES 


\J41 VS UC>li IJ41 VS Uoli 19739 


2 


YES 


C4z VS (JiSli (J4Z VS Obli 19740 


2 


YES 


C49 VS OSE C49 vs OSF 1 974? 


z 


YSS 


C79 VS OSE C79 vs OSE 21367 


2 


YES 


C87 VS OSE C87 vs OSE 19744 


2 


YES 


C95 VS OSE C95 vs OSE 21369 


2 


YES 




Overall Success 


91.80% 
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